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APPLICANT(S) FOR DO/EO/US Luet L. Wong, et al. 



Applicant herewith submits to the United States Designated/Lic~cted Office (DO/EO/US) the following items and other information: 

1 . This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 

2. | [ This is a SECOND or SUBSEQUENT submission of items concerning a filing 35 U S.C 371 
3 I I This is an express request to begin national examination procedures (35 U.S.C. 371 (f)). The submission must 

* I ' include items (5), (6), (9) and (21) indicated below. 

4. j^j The US has been elected by the expiration of 1 9 months fiom the priority date (PCT Article 31). 

5. | x | A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

a. | x | is attached hereto (required only if not communicated by the International Bureau). 0 

b. | | has been communicated by the International Bureau. 

c. is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. [^j An English language translation of the International Application as tiled (35 U.S C. 3 71 (c)(2)) 

a. | | is attached hereto. 

b. | | has been previously submitted under 35 U S.C. 154(d)(4). 

7. |~^J Amendments to the claims of the International Application under PCT Article 19 (35 U S.C. 371 (c)(3)) 

a. | x | ar e attached hereto (required only if not communicated by the International Bureau). 

b. | | have been communicated by the International Bureau. 

c. | | have not been made; however, the time limit for making such amendments has NOT expired. 

d. [ | have not been made and will not be made. 

8. j^J An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 

9. | | An oath or declaration of the inventor(s) (35 U S C. 371 (c)(4)). 

0 I I An English language translation of the annexes to the International Preliminary Examination Report under PCT 

' I 1 Article 36 (35 U.S.C. 371 (c)(5)). 

ems 11 to 20 below concern docunient(s) or information included: 

1 . | | An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

2. [ | An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

3. QTJ A FIRST preliminary amendment. 

4. Q^j A SECOND or SUBSEQUENT preliminary amendment. 

5. | [ A substitute specification. 

6. [ | A change of power of attorney and/or address letter. 

7. | x | A computer-readable form of the sequence listing in accordance with PCT Rule I3ter.2 and 35 U.S.C 1 .821 - 1.825 

8. | | A second copy of the published international application under 35 U.S.C. 154(d)(4). 

9. | | A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 

20. 1 x I Other items or information: Power of Attorney 
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| x | The following fees are submitted: 

BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5)): 

□ Neither international preliminary examination fee (37 CFR 1 482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO $1040.00 

□ International preliminary examination fee (37 CFR 1 482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO . $890.00 

□ International preliminary examination fee (37 CFR 1.482) not paid to USPTO 
but international search fee (37 CFR 1 445(a)(2)) paid to USPTO . . . $740.00 

□ International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT^Article 33(l)-(4) $710.00 

□ International preliminary examination fee (37 CFR 1 482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 

Surcharge of $ for furnishing the oath or declaration later than 

| | 20 Q 30 months from the earliest claimed priority date (37 CFR 1 .492 (e)). 



CALCULATIONS pto use only 



890.00 



CLAIMS 



NUMBER FILED NUMBER EXTRA 



RATE 



Total claims 



18-20 



0.00 



Independent claims 



5-5 = 



0.00 



MULTIPLE DEPENDENT CLAIM(s) (if applicable) 



TOTAL OF ABOVE CALCULATIONS - 



890.00 



□ 



Applicant claims small entity status. See 37 CFR 1 .27. The fees indicated above 
are reduced by 'A 



SUBTOTAL = 



890.00 



Processing fee of $ 



for furnishing the English translation later than 



[ 1 20 30 months from the earliest claimed priority date (37 CFR 1 .492 (f)). + 



TOTAL NATIONAL FEE = 



890.00 



Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). Assignment 
must be accompanied by appropriate cover sheet (37 CFR 3.28, 3.3 1 ) 

( per property). 



TOTAL FEES ENCLOSED 



890.00 



Amount to be 
Refunded: 



Charged: 



A check in the amount of $ 



• □ 

b. | x | Please charge my Deposit Account No. 06-2375 



to cover the above fees is enclosed. 



in the amount of $ 



890.00 



to cover the above fees. A duplicate copy of this sheet is enclosed. 

c. jj<J The Commissioner is hereby authorized to charge any additional fees which may be required or credit 

any overpayment to my Deposit Account No. 06-2375 A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive 
(37 CFR 1.137 (a) or (b)) must be filed and granted to restore the application to pending status. 
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Houston, Texas 77010-3095 
(713) 651-8231 
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IN THE UNITED STATES PATENT A 



In re Application of: 

Luet Lok Wong 
Jonathan Peter Jones 

Serial No.: Unassigned 

Filed: Herewith 

For: PROCESS FOR OXIDISING 
AROMATIC COMPOUNDS 



§ 
§ 



§ Attorney Dkt: HO-P02353US0 

§ 

§ Group Art Unit: Unassigned 

§ 

§ Examiner: Unassigned 

§ 
§ 



PRELIMINARY AMENDMENT 

BOX: PCT 

Commissioner for Patents 
Washington, DC 20231 



Dear Sir: 

This is a Preliminary Amendment to accompany the U.S. National Phase filing under 
35 U.S.C. § 371 of PCT/GB00/02379, international filing date June 19, 2000, with priority 
claim to June 18, 1999 (GB 9914373.7). This Amendment substitutes the claims to be 
examined. 

AMENDMENT 



IN THE CLAIMS: 

Please cancel claims 1-19 of the PCT application and substitute in their place the 
below-presented claims 1-18: 

1. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised. 

2. Process according to claim 1 in which the enzyme comprises a substitution of 
an amino acid in the active site by an amino acid with a less polar side-chain. 

3. Process according to claim 2 in which the enzyme comprises one or more 
other amino acid substitutions in the active site. 
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4. Process according to claim 1 in which the enzyme is: 

(i) P450 cam , or 

(ii) a naturally occurring homologue of (i), or 

(iii) a mutant of (i) or (ii). 

5. Process according to claim 4 in which the enzyme is one in which amino acid 
96 of P450 C am ? or the equivalent amino acid in a homologue, has been changed to an amino 
acid with a less polar side-chain. 

6. Process according to claim 1 in which the halogen is chlorine. 

7. Process according to claim 1 in which the aromatic compound is a benzene or 
biphenyl. 

8. Process for oxidising a halo aromatic substrate, which process comprises 
oxidising said substrate with a monooxygenase enzyme, wherein the substrate is 1, 2- 
dichlorobenzene, 1, 2, 4- trichlorobenzene, 3,3'-dichlorobiphenyl, 2,2\4,5,5'- 
pentachlorobiphenyl, pentachlorobenzene or hexachlorobenzene 

9. Process according to claim 8 in which the enzyme is: 

(i) P450 cam , or 

(ii) a naturally occurring homologue of (i), or 

(iii) a mutant of (i) or (ii). 

10. Process according to claim 1 which is carried out in a cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 

11. Process according to claim 10 in which (b) is putidaretoxin reductase or a 
homologue; or a fragment thereof; and/or (c) is putidaretoxin or a homologue; or a fragment 
thereof. 

12. Process according to claim 10 wherein the cell is one in which the enzyme (a) 
does not naturally occur. 

13. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised , the process being carried out in a 
cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, 

wherein the cell is one which in its naturally occurring form is able to oxidise a halo aromatic 
substrate. 

14. A cell as defined in claim 13. 
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15. A transgenic non-human animal or transgenic plant whose cells are as 

defined in claim 10. 

16. Method of treating a locus contaminated with a halo aromatic substrate 
comprising contacting the locus with: 

(i) a monooxygenase enzyme, or 

(ii) a cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, or 

(iii) a non-human transgenic animal or transgenic plant whose cells express (a), (b) 

and (c). 

17. Process for selecting a mutant of a monooxygenase enzyme for its ability to 
oxidise a halo aromatic substrate, which process comprises screening a library of 
monooxygenase mutants for their oxidation effect on the substrate. 

18. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised, and wherein the enzyme is one 
that has been selected in a process according to claim 17. 

REMARKS 

Please substitute and examine the above-presented claims for those pending in the 
PCT application. Please begin numbering with the number 1 . The above-presented claims 
represent re-written claims of the PCT application in U.S. format. No new matter has been 
added. 

Sequence Listing 

A Sequence Listing is submitted herewith in computer readable form and as a print 
out. The sequences are identical to those in the PCT application, and the content of the 
computer readable disk is identical to that of the prior out. 

Please charge any fees due or credit any overpayment to the standing account of 
Fulbright & Jaworski L.L.P. No. 06-2375 under order no. P02353US0 on which the 
undersigned is authorized to sign.. 
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Please contact the undersigned attorney for resolution of any questions or issues in 



this matter. 



Respectfully submitted, 



Reg. No. 40,612 
Fulbright & Jaworski L.L.P. 
1301 McKinney, Suite 5100 
Houston, TX 77010-3095 
713.651.5151 
713.651.5246 (facsimile) 

Attorney for Applicants 
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CLAIMS 

1. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised. 

2. Process according to claim 1 in which the enzyme comprises a substitution 
of an amino acid in the active site by an amino acid with a less polar side-chain. 

3. Process according to claim 2 in which the enzyme comprises one or more 
other amino acid substitutions in the active site. 

4. Process according to claim 1 in which the enzyme is: 

(i) P450 cam , or 

(ii) a naturally occurring homologue of (i), or 
(hi) a mutant of (i) or (ii). 

5. Process according to claim 4 in which the enzyme is one in which amino 
acid 96 of P450 cam or the equivalent amino acid in a homologue, has been changed to an 
amino acid with a less polar side-chain. 

6. Process according to claim 1 in which the halogen is chlorine. 

7. Process according to claim 1 in which the aromatic compound is a benzene 
or biphenyl. 

8. Process for oxidising a halo aromatic substrate, which process comprises 
oxidising said substrate with a monooxygenase enzyme, wherein the substrate is 1,2- 
dichlorobenzene, 1, 2, 4- trichlorobenzene, 3,3'-dichlorobiphenyl, 2,2', 4,5,5'- 
pentachlorobiphenyl, pentachlorobenzene or hexachlorobenzene 

9. Process according to claim 8 in which the enzyme is: 

(i) P450 cam , or 

(ii) a naturally occurring homologue of (i), or 
(hi) a mutant of (i) or (ii). 

10. Process according to claim 1 which is carried out in a cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 
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1 1. Process according to claim 10 in which (b) is putidaretoxin reductase or a 
homologue; or a fragment thereof; and/ or (c) is putidaretoxin or a hoxnologue; or a 
fragment thereof. 

12. Process according to claim 10 wherein the cell is one in which the enzyme 
(a) does not naturally occur. 

13. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised , the process being carried out 
in a cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, 

wherein the cell is one which in its naturally occurring form is able to oxidise a halo 
aromatic substrate. 

14. A cell as defined in claim 13. 

15. A transgenic non-human animal or transgenic plant whose cells are as 
defined in claim 10. 

16. Method of treating a locus contaminated with a halo aromatic substrate 
comprising contacting the locus with: 

(i) a monooxygenase enzyme, or 

(ii) a cell that expresses: 

(a) a monooxygenase enzyme; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, or 

(hi) a non-human transgenic animal or transgenic plant whose cells express (a), (b) 

and (c). 

17. Process for selecting a mutant of a monooxygenase enzyme for its ability to 
oxidise a halo aromatic substrate, which process comprises screening a library of 
monooxygenase mutants for their oxidation effect on the substrate. 

18. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised, and wherein the enzyme is one 
that has been selected in a process according to claim 17. 
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I hereby certify that this correspondence is being deposited with the U S 
Postal Service as Express Mail, Airbill No.EU098493435US, in an envelope 
addressee! to Box PCT. Commissioner for Patents, Washington, DC 20231, 
on the date shown below * ^ 




Dated April 4, 2002 Signature: /7S Jn*C0~> <*=***0\ 

(Linda Bourg) 



Docket No.: P02353US1 
(PATENT) 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Patent Application of: 
Wong et al. 



Box PCT, DOEO 
Commissioner for Patents 
Washington, DC 20231 

Dear Sir: 

Prior to examination on the merits, please amend the above-identified U.S. patent 
application as follows: 

1. Attorney Docket Number: 

Please note that the attorney docket number has been changed from P02353US0 to: 
P02353US1. Applicants respectfully request that this change be reflected on all 
correspondence from the Office. 

2. In the Specification: 

Please substitute the below amended paragraphs to correct typographical errors, to 
direct SEQ ID NOs or corrections thereof into the specification. 



Application No.: 10/01 8/730 
Filed: December 18, 2001 



Group Art Unit: Not yet assigned 



Int'l Appln.No.: PCT/GB00/02379 
Filed: June 19, 2000 



Examiner: Not yet assigned 



For: PROCESS FOR OXIDISING AROMATIC 
COMPOUNDS 



SECOND PRELIMINARY AMENDMENT 



Application No. 10/018,730 



Docket No. HO-P02353US1 



Paragraph on page 6, at lines 2-7: 

Typically (i) is any allelic variant of P450 ca m of Pseudomonas putida 
(e.g. of the polypeptide sequence shown in SEQ ID No. 2). Typically (ii) is a 
species homologue of (i) which has sequence homology with (i), and is 
typically P450 B m-3 of Bacillus megaterium (e.g. the polypeptide sequence 
shown in SEQ ID No. 4 and the nucelotide sequence is shown in SEQ ID NO: 
3), P450 ler p of Pseudomonas sp, P450 ery F of Saccharopollyspora erythraea, or 
P450 105 Dl (CYP105) of Streptomyces griseus strains. 



Paragraph on page 6, at lines 22-32: 

The discussion below provides examples of the positions at which 
substitutions may be made in P450 ca m. The same substitutions may be made at 
equivalent positions in the homologues. Standard nomenclature is used to 
denote the mutations. The letter of the amino acid present in the natural form 
is followed by the position, followed by the amino acid in the mutant. To 
denote multiple mutations in the same protein each mutation is listed separated 
by hyphens. The mutations discussed below using this nomenclature specify 
the natural amino acid in P450 cam , but it is to be understood that the mutation 
could be made to a homologue which has a different amino acid at the 
equivalent position. Note that the amino acid numbering shown in SEQ ID 
No. 2 for P450 cam does not correspond to the numbering used in the 
description to denote mutations. The numbering in SEQ ID No. 2 is one more 
than the numbering in the description for a particular position. 



Paragraph on page 9, at lines 4-10: 

Alternatively a linker may be present between the components. The 
linker generally comprises amino acids that do not have bulky side chains and 
therefore do not obstruct the folding of the protein subunits. Preferably the 
amino acids in the linker are uncharged. Preferred amino acids in the linker are 
glycine, serine, alanine or threonine. In one embodiment the linker comprises 
the sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C (SEQ ID NO:6). The linker 
is typically from at least 5 amino acids long, such as at least 10, 30 or 50 or 
more amino acids long. 



Paragraph on page 11, at lines 2-6: 

In one embodiment the three genes encoding the three proteins of the 
P450 ca m system, i.e. camA, camB, and camC are placed in the mobile regions 
of standard transposon vectors and incorporated into the genome of 
Pseudomonas and flavobacteria. Alternatively plasmid vectors for expressing 
these genes may used, in which case the P450 ca tn gene cluster will be extra- 
chromosomal. 
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I 

Paragraph on page 14, at lines 2-18: 

Oligonucleotide-directed site-specific mutagenesis was carried out by 
the Kunkel method (Kunkel, T. A. Proc. Natl Acad. Sci. USA 1985, 82, 488- 
492) using the Bio-Rad Mutagen kit. The recommended procedure is 
summarised as follows. An Ml 3 mpl9 subclone of the camC gene encoding 
P450 cani (SEQ ID NO: 1) was propagated in the E. coli strain CJ236. This 
strain has the duf ung~ phenotype and thus will tolerate the inclusion of uracil 
in place of thymine in DNA molecules. After three cycles of infection, uracil- 
containing single stranded (USS) Ml 3 DNA was readily isolated by phenol 
extraction of mature Ml 3 phage particles excreted into the growth medium. 
The mutagenic oligonucloetide (or oligonucleotides) were phosphorylated 
with T4 polynucleotide kinase and then annealed to the USS template. The 
four nucleotides, DNA polymerase, DNA ligase, ATP and other chemical 
components were added and the second strand was synthesised in vitro. The 
double stranded form thus obtained was transformed into the dut+ ung+ E. 
coli strain MV1190, which should degrade the uracil-containing template 
strand and propagate the mutant strand synthesised in vitro. Plaques were 
picked and phages of possible mutants grown in E. coli strains MV1190 or 
TGI. The single-stranded DNA from these were sequenced to determine 
whether the mutagenesis reaction was successful. The mutagenic efficiency 
was 50 - 80%. 



Paragraph on page 16, at lines 19-22: 

For the camA gene the primer below (SEQ ID NO: 5) was used at the 5* end of the 
gene to introduce the Eco RI cloning site and to change the first codon from GTG to the 
strong start codon ATG. 

5'- GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC 
GCA AAC 

Eco RI RBS \-camA 

Paragraph bridging page 16, line 24 to page 17, line 3: 

At the y end of camA the primer (nucleotide sequence is SEQ ID NO: 
7) was designed such that 1 5 bases are complementary to nucleotide sequence 
of the last five amino acid residues of camA. The stop codon immediately 
after the GCC codon for the last amino acid was removed, and then part of a 
seven amino acid linker (Thr Asp Gly Gly Ser Ser Ser; SEQ ID NO: 6) which 
contained a Bam HI cloning site (GGATCC = Gly Ser) was introduced. The 
coding sequence was thus (amino acid sequence is SEQ ID NO: 8): 

5'- GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3 1 
camA - Thr Asp Gly Gly Ser 

\Bam HI | 



Docket No. HO-P02353US1 
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Paragraph on page 17, at lines 5-6: 

The primer sequence shown below (SEQ ID NO: 9) is the reverse 
complement used for PCR: 

5'- CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3' 

Paragraph on page 17, at lines 8-13: 

For the camB gene the primer at the 5' end (nucleotide sequence is 
SEQ ID NO: 10; amino acid sequence is SEQ ID NO: 11) incorporated the 
second half of the peptide linker between the reductase and redoxin proteins, 
and the restriction site Bam HI for joining the two amplified genes together. 
5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
Gly Ser Ser Ser | - camB 
\Bamm\ Start 

Paragraph on page 17, at lines 14-19: 

At the 3' end of camB the primer incorporates 12 nucleotides 
complementary to the end of camB followed by the stop codon TAA, a 6 
nucleotide spacer before the GGAG ribosome binding site. Xba I and Hind III 
sites were then added to allow cloning of the camC gene when required. The 
sequence of the coding strand was therefore (SEQ ID NO: 12): 

5'- CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT 

TCA TCG-3' 

CamB ->| stop RBSJ^al Hind 111 

Paragraph on page 17, at lines 20-21: 

The primer shown below (SEQ ID NO: 13) is the reverse complement 
used for PCR: 

5'-CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA 
TTG CCT ATC GGG -3' 



Paragraph on page 19, at lines 2-6: 

We used the internal and unique restriction site Mlu I (recognition 
sequence ACGCGT) within the camB gene as the starting point so that the 
PCR product has a different size from the PCR template fragment. The 
primers were as follows: 

5'- TCA TCG ACG CGT CGC GAA CTG CTG-3' (SEQ ID NO: 14) 
where the Mlu I site is in bold. 
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Paragraph on page 19, at lines 7-10: 

The desired coding sequence at the 3' end of the camB gene (SEQ ID 
NO: 15) was: 

5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB - 1 stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3' 
RBS Xba I 



Paragraph bridging page 20, lines 25 to page 21, line 7: 

The primers shown below maintain the Hind III cloning site AAGCTT: 

5'-TCA TCG AAG CTT GGC TGT TTT-3 ? (SEQ ID NO: 16) 
Hind III |- vector 

At the other end the coding sequence desired was (SEQ ID NO: 17): 

5'- AC A ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT 
TCA TCG-3' 

Vector - | RBS Xba I Nde I Hind III 

This sequence maintained the Nde I and Hind III sites but the new Xba I site 
was introduced upstream of the Nde I site. The PCR primer used was the 
reverse complement of the desired sequence (SEQ ID NO: 18): 

5'-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG 
TGA AAT TGT-3' 



25152536.1 



A,0-0:i e 73j 



Application No. 10/018,730 



Docket No. HO-P02353US1 



REMARKS 



The Action, mailed February 4, 2002, requires that a substitute Sequence Listing that 
is in compliance with 37 C.F.R. §§ 1.822 and/or 1.823 be provided. In response, Applicants 
have provided herewith a substitute copy of the corrected Sequence Listing in computer 
readable form and in paper form under 37 C.F.R. § 1.825, respectively. Also, as required 
under 37 C.F.R. § 1.821(d), amendments to the specification directing reference to the 
sequence by use of the sequence identifier "SEQ ID NO:" in the text of the description have 
been provided by way of this amendment under 37 C.F.R. § 1.121. Applicants assert that no 
new matter has been added. Applicants assert that the amendments made herein are directed 
to typographical errors, or amendments of the specification to clearly identify sequences 
referenced in the text of the description to the corrected Sequence Listing, filed herewith. 



Applicants have amended the specification to clarify the sequences referenced in the 
specification to the sequences given in the Sequence Listing. Therefore, these amendments 
do not narrow the scope of the claims within the meaning of Festo Corp. v. Shoketsu Kinzoku 
Kogyo Kabushiki Co., Ltd., 234 F.3d 558, 586, 56 USPQ2d 1865, 1886 (Fed. Cir. 2000). 



CONCLUSION 



Dated: April 4, 2002 



Respectfully submitted, 




Elizabeth A. Hart, Registered Patent Agent 



Registration No.: 50,931 
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Houston, Texas 77010-3095 
(713) 651-5698 
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Appendix A 

Version With Markings to Show Changes Made to Specification 



Paragraph on page 6, at lines 2-7: 

Typically (i) is any allelic variant of P450 cam of Pseudomonas putida 
(e.g. of the polypeptide sequence shown in SEQ ID No. 2 [1]). Typically (ii) 
is a species homologue of (i) which has sequence homology with (i), and is 
typically P450 B m-3 of Bacillus megaterium (e.g. the polypeptide sequence 
shown in SEQ ID No. 4J2] and the nucelotide sequence is shown in SEQ ID 
NO: 3 ), P450 ter p of Pseudomonas sp y P450 ery F of Saccharopollyspora 
erythraea, or P450 105 Dl (CYP105) of Streptomyces griseus strains. 



Paragraph on page 6, at lines 22-32: 

The discussion below provides examples of the positions at which 
substitutions may be made in P450 cam . The same substitutions may be made at 
equivalent positions in the homologues. Standard nomenclature is used to 
denote the mutations. The letter of the amino acid present in the natural form 
is followed by the position, followed by the amino acid in the mutant. To 
denote multiple mutations in the same protein each mutation is listed separated 
by hyphens. The mutations discussed below using this nomenclature specify 
the natural amino acid in P450 cam , but it is to be understood that the mutation 
could be made to a homologue which has a different amino acid at the 
equivalent position. Note that the amino acid numbering shown in SEQ ID 
No. 2 [1] for P450 ca m does not correspond to the numbering used in the 
description to denote mutations. The numbering in SEQ ID No. 2 [1] is one 
more than the numbering in the description for a particular position. 

Paragraph on page 9, at lines 4-10: 

Alternatively a linker may be present between the components. The 
linker generally comprises amino acids that do not have bulky side chains and 
therefore do not obstruct the folding of the protein subunits. Preferably the 
amino acids in the linker are uncharged. Preferred amino acids in the linker are 
glycine, serine, alanine or threonine. In one embodiment the linker comprises - 
the sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C (SEQ ID NO:6) . The linker 
is typically from at least 5 amino acids long, such as at least 10, 30 or 50 or 
more amino acids long. 
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Paragraph on page 11, at lines 2-6: 

In one embodiment the three genes encoding the three proteins 
[protiens] of the P450 carn system, i.e. camA, camB, and camC are placed in the 
mobile regions of standard transposon vectors and incorporated into the 
genome of Pseudomonas and flavobacteria. Alternatively plasmid vectors for 
expressing these genes may used, in which case the P450 cam gene cluster will 
be extra-chromosomal. 

Paragraph on page 14, at lines 2-18: 

Oligonucleotide-directed site-specific mutagenesis was carried out by 
the Kunkel method (Kunkel, T. A. Proc. Natl. Acad. ScL USA 1985, 82, 488- 
492) using the Bio-Rad Mutagen kit. The recommended procedure is 
summarised as follows. An Ml 3 mpl9 subclone of the camC gene encoding 
P450 cam (SEQ ID NO: 1) was propagated in the E. coli strain CJ236. This 
strain has the dut ung~ phenotype and thus will tolerate the inclusion of uracil 
in place of thymine in DNA molecules. After three cycles of infection, uracil- 
containing single stranded (USS) Ml 3 DNA was readily isolated by phenol 
extraction of mature Ml 3 phage particles excreted into the growth medium. 
The mutagenic oligonucloetide (or oligonucleotides) were phosphorylated 
with T4 polynucleotide kinase and then annealed to the USS template. The 
four nucleotides, DNA polymerase, DNA ligase, ATP and other chemical 
components were added and the second strand was synthesised in vitro. The 
double stranded form thus obtained was transformed into the dut+ ung+ E. 
coli strain MV1190, which should degrade the uracil-containing template 
strand and propagate the mutant strand synthesised in vitro. Plaques were 
picked and phages of possible mutants grown in E. coli strains MV1190 or 
TGI. The single- stranded DNA from these were sequenced to determine 
whether the mutagenesis reaction was successful. The mutagenic efficiency 
was 50 - 80%. 



Paragraph on page 16, at lines 19-22: 

For the cam A gene the primer below (SEQ ID NO: 5) was used at the 5' end of the 
gene to introduce the Eco RI cloning site and to change the first codon from GTG to the 
strong start codon ATG. 

5*- GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC 

GCA AAC 



Eco RI 



RBS 



\-*camA 



Paragraph bridging page 16, line 24 to page 17, line 3: 



At the 3' end of camA the primer (nucleotide sequence is SEQ ID NO: 
7} was designed such that 1 5 bases are complementary to nucleotide sequence 
of the last five amino acid residues of camA. The stop codon immediately 
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after the GCC codon for the last amino acid was removed, and then part of a 
seven amino acid linker (Thr Asp Gly Gly Ser Ser Se r; SEQ ID NO: 6 ) which 
contained a Bam HI cloning site (GGATCC = Gly Ser) was introduced. The 
coding sequence was thus (amino acid sequence is SEQ ID NO: 8) : 

5 f - GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3' 
camA - Thr Asp Gly Gly Ser 

\BamBI\ 



Paragraph on page 17, at lines 5-6: 

The primer sequence shown below (SEQ ID NO: 9) is the reverse 
complement used for PCR: 

5'- CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3 1 



Paragraph on page 17, at lines 8-13: 

For the camB gene the primer at the 5* end (nucleotide sequence is 
SEQ ID NO: 10; amino acid sequence is SEQ ID NO: 11) incorporated the 
second half of the peptide linker between the reductase and redoxin proteins, 
and the restriction site Bam HI for joining the two amplified genes together. 
5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
Gly Ser Ser Ser |- camB 
\Bam HI | Start 

Paragraph on page 17, at lines 14-19: 

At the 3 f end of camB the primer incorporates 12 nucleotides 
complementary to the end of camB followed by the stop codon TAA, a 6 
nucleotide spacer before the GGAG ribosome binding site. Xba I and Hind III 
sites were then added to allow cloning of the camC gene when required. The 
sequence of the coding strand was therefore (SEQ ID NO: 12) : 

5'- CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT 

TCA TCG-3' 

CamB - 1 stop RBSXbal Hind III 



Paragraph on page 17, at lines 20-21: 

The primer shown below (SEQ ID NO: 13) is the reverse complement 
used for PCR: 

5'-CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA 
TTG CCT ATC GGG -3' 
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Paragraph on page 19, at lines 2-6: 

We used the internal and unique restriction site Mlu I (recognition 
sequence ACGCGT) within the camB gene as the starting point so that the 
PCR product has a different size from the PCR template fragment. The 
primers were as follows: 

5'- TCA TCG ACG CGT CGC GAA CTG CTG-3' (SEQ ID NO: 14) 
where the Mlu I site is in bold. 



Paragraph on page 19, at lines 7-10: 

The desired coding sequence at the 3 f end of the camB gene (SEQ ID 
NO: 15) was: 

5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB - 1 stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3' 
RBS Xba I 



Paragraph bridging page 20, lines 25 to page 21, line 7: 



The primers shown below maintain the Hind III cloning site AAGCTT: 

5'-TCA TCG AAG CTT GGC TGT TTT-3 1 (SEQ ID NO: 16) 
Hind 111 |- vector 

At the other end the coding sequence desired was (SEQ ID NO: 17) : 

5'- AC A ATT TCA CAC AGG A TCT AGA C CAT ATG TCA TCG AAG CTT 
TCA TCG-3* 

Vector -> | RBS Xba I Nde I Hind III 

This sequence maintained the Nde I and Hind III sites but the new Xba I site 
was introduced upstream of the Nde I site. The PCR primer used was the 
reverse complement of the desired sequence ( SEQ ID NO: 18) : 

5'-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG 
TGA AAT TGT-3' 
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Appendix B 

Clean Version of Amendments to Specification on April 4, 2002 
Paragraph on page 6, at lines 2-7: 

Typically (i) is any allelic variant of P450 cam of Pseudomonas putida 
(e.g. of the polypeptide sequence shown in SEQ ID No. 2). Typically (ii) is a 
species homologue of (i) which has sequence homology with (i), and is 
typically P450 B m-3 of Bacillus megaterium (e.g. the polypeptide sequence 
shown in SEQ ID No. 4 and the nucelotide sequence is shown in SEQ ID NO: 
3), P450 te rp of Pseudomonas sp, P450 ery F of Saccharopollyspora erythraea, or 
P450 105 Dl (CYP105) of Streptomyces griseus strains. 

Paragraph on page 6, at lines 22-32: 

The discussion below provides examples of the positions at which 
substitutions may be made in P450 cam . The same substitutions may be made at 
equivalent positions in the homologues. Standard nomenclature is used to 
denote the mutations. The letter of the amino acid present in the natural form 
is followed by the position, followed by the amino acid in the mutant. To 
denote multiple mutations in the same protein each mutation is listed separated 
by hyphens. The mutations discussed below using this nomenclature specify 
the natural amino acid in P450 cam , but it is to be understood that the mutation 
could be made to a homologue which has a different amino acid at the 
equivalent position. Note that the amino acid numbering shown in SEQ ID 
No. 2 for P450 C am does not correspond to the numbering used in the 
description to denote mutations. The numbering in SEQ ID No. 2 is one more 
than the numbering in the description for a particular position. 

Paragraph on page 9, at lines 4-10: 

Alternatively a linker may be present between the components. The 
linker generally comprises amino acids that do not have bulky side chains and 
therefore do not obstruct the folding of the protein subunits. Preferably the 
amino acids in the linker are uncharged. Preferred amino acids in the linker are 
glycine, serine, alanine or threonine. In one embodiment the linker comprises 
the sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C (SEQ ID NO:6). The linker 
is typically from at least 5 amino acids long, such as at least 10, 30 or 50 or 
more amino acids long. 

Paragraph on page 11, at lines 2-6: 

In one embodiment the three genes encoding the three proteins of the 
P450 cam system, i.e. camA, camB, and camC are placed in the mobile regions 
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of standard transposon vectors and incorporated into the genome of 
Pseudomonas and flavobacteria. Alternatively plasmid vectors for expressing 
these genes may used, in which case the P450 cam gene cluster will be extra- 
chromosomal. 

Paragraph on page 14, at lines 2-18: 

Oligonucleotide-directed site-specific mutagenesis was carried out by 
the Kunkel method (Kunkel, T. A. Proc. Natl. Acad. Sci. USA 1985, 82, 488- 
492) using the Bio-Rad Mutagen kit. The recommended procedure is 
summarised as follows. An M13 mpl9 subclone of the camC gene encoding 
P450 cam (SEQ ID NO: 1) was propagated in the E. coli strain CJ236. This 
strain has the dufung~ phenotype and thus will tolerate the inclusion of uracil 
in place of thymine in DNA molecules. After three cycles of infection, uracil- 
containing single stranded (USS) Ml 3 DNA was readily isolated by phenol 
extraction of mature Ml 3 phage particles excreted into the growth medium. 
The mutagenic oligonucloetide (or oligonucleotides) were phosphorylated 
with T4 polynucleotide kinase and then annealed to the USS template. The 
four nucleotides, DNA polymerase, DNA ligase, ATP and other chemical 
components were added and the second strand was synthesised in vitro. The 
double stranded form thus obtained was transformed into the dut+ ung+ E. 
coli strain MV1190, which should degrade the uracil-containing template 
strand and propagate the mutant strand synthesised in vitro. Plaques were 
picked and phages of possible mutants grown in E. coli strains MV1190 or 
TGI. The single-stranded DNA from these were sequenced to determine 
whether the mutagenesis reaction was successful. The mutagenic efficiency 
was 50 - 80%. 



Paragraph on page 16, at lines 19-22: 

For the camA gene the primer below (SEQ ID NO: 5) was used at the 5' end of the 
gene to introduce the Eco RI cloning site and* to change the first codon from GTG to the 
strong start codon ATG. 

5'- GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC 

GC A AAC 

EcoKl RBS \^camA 

Paragraph bridging page 16, line 24 to page 17, line 3: 

At the 3' end of camA the primer (nucleotide sequence is SEQ ID NO: 
7) was designed such that 15 bases are complementary to nucleotide sequence 
of the last five amino acid residues of camA. The stop codon immediately 
after the GCC codon for the last amino acid was removed, and then part of a 
seven amino acid linker (Thr Asp Gly Gly Ser Ser Ser; SEQ ID NO: 6) which 
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contained a Bam HI cloning site (GGATCC = Gly Ser) was introduced. The 
coding sequence was thus (amino acid sequence is SEQ ID NO: 8): 

5'- GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3 f 
camA -* Thr Asp Gly Gly Ser 

\Bam HI| 

Paragraph on page 17, at lines 5-6: 

The primer sequence shown below (SEQ ID NO: 9) is the reverse 
complement used for PCR: 

5'- CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3' 



Paragraph on page 17, at lines 8-13: 

For the camB gene the primer at the 5' end (nucleotide sequence is 
SEQ ID NO: 10; amino acid sequence is SEQ ID NO: 11) incorporated the 
second half of the peptide linker between the reductase and redoxin proteins, 
and the restriction site Bam HI for joining the two amplified genes together. 
5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
Gly Ser Ser Ser | - camB 
\Bamm\ Start 

Paragraph on page 17, at lines 14-19: 

At the 3 ? end of camB the primer incorporates 12 nucleotides 
complementary to the end of camB followed by the stop codon TAA, a 6 
nucleotide spacer before the GGAG ribosome binding site. Xba I and Hind III 
sites were then added to allow cloning of the camC gene when required. The 
sequence of the coding strand was therefore (SEQ ID NO: 12): 

5'- CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT 

TCA TCG-3' 

CamB - 1 stop RBSXbal Hind III 



Paragraph on page 17, at lines 20-21: 

The primer shown below (SEQ ID NO: 13) is the reverse complement 
used for PCR: 

5'-CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA 
TTG CCT ATC GGG -3' 



Paragraph on page 19, at lines 2-6: 

We used the internal and unique restriction site Mlu I (recognition 
sequence ACGCGT) within the camB gene as the starting point so that the 



25152536.1 



13 



i o n :i S "7 3 a - O MPOH-O 5 

Application No. 10/018,730 Docket No. HO-P02353US1 



PCR product has a different size from the PCR template fragment. The 
primers were as follows: 

5'- TCA TCG ACG CGT CGC GAA CTG CTG-3' (SEQ ID NO: 14) 
where the Mlu I site is in bold. 



Paragraph on page 19, at lines 7-10: 

The desired coding sequence at the 3' end of the camB gene (SEQ ID 
NO: 15) was: 

5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB - 1 stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3* 
RBS Xba I 



Paragraph bridging page 20, lines 25 to page 21, line 7: 

The primers shown below maintain the Hind III cloning site AAGCTT: 

5'-TCA TCG AAG CTT GGC TGT TTT-3' (SEQ ID NO: 16) 
Hind III |- vector 

At the other end the coding sequence desired was (SEQ ID NO: 17): 

5 -ACA ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT 

TCA TCG-3' 

Vector - | RBS Xba I Nde I Hind III 

This sequence maintained the Nde I and Hind III sites but the new Xba I site 
was introduced upstream of the Nde I site. The PCR primer used was the 
reverse complement of the desired sequence (SEQ ID NO: 18): 

5*-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG 
TGA AAT TGT-3' 



{ 
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PROCESS FOR OXIDISING AROMATIC COMPOUNDS 
The invention relates to a process for enzymatically oxidising halogenated 
aromatic compounds. 

Chlorinated aromatic compounds such as the chlorobenzene and polychlorinated 
5 biphenyls (PCBs) are among the most wide-spread organic contaminants in the 

environment due to their common application as solvents, biocides, and in the heavy 
electrical industry. They are also some of the most problematic environmental pollutant, 
not only because of the health hazards (lipid solubility and hence accumulation in fatty 
tissues, toxicity and carcinogenicity) but also because of their slow degradation in the 
10 environment. 

Whilst microorganisms have shown extraordinary abilities to adapt and evolve to 
degrade most of the organic chemicals released into the environment, the most chemically 
inert compounds such as PCBs do persist for two main reasons. First, these compounds 
have very low solubility in water and therefore their bioavailability is low. Research into 
1 5 this problem has focussed on the use of detergents and other surfactants to enhance their 
solubility and bioavailability. Second, these compounds require activation by enzymatic 
oxidation or reduction, and it can take a long time for the necessary genetic adaptations by 
microorganisms to occur, and even then the organisms may not be stable and viable. 

We have now found, according to the present invention, that a monoxygenase, in 
20 particular P450 cam and its physiological electron transfer partners putidaretoxin and 

putidaretoxin reductase, can be used to oxidise halogenated aromatic compounds. Also 
mutants of the monoxygenase with substitutions in the active site have enhanced oxidation 
activity. Thus suitable monoxygenases can be expressed in microorganisms, animals and 
plants which are going to be used to oxidise the halogenated aromatic compounds. 
25 Accordingly the present invention provides a process for oxidising a substrate 

which is a halo aromatic compound, which process comprises oxidising said substrate with 
a monooxygenase enzyme. 

The process may be carried out in a cell that expresses: 
(a) the enzyme 
30 (b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 
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The halo aromatic compound is typically a benzene or biphenyl compound. The 
benzene ring is optionally fused and can be substituted. The halogen is typically chlorine. 
In many cases there is more than one halogen atom in the molecule, typically 2 to 5 or 6, 
for example 3 . Generally 2 of the halogen atoms will be ortho or para to one another. The 
compound may or may not contain an oxygen atom such as a hydroxy group, an aryloxy 
group or a carboxy group. The compound may or may not be chlorophenol or a 
chlorophenoxyacetic compound. 

Specific compounds which can be oxidised by the process of the present 
invention include 1, 2; 1,3- and 1,4-dichlorobenzene, 1, 2, 4; 1, 2, 3- and 1, 3, 5- 
trichlorobenzene, 1, 2, 4, 5- and 1, 2, 3, 5- tetrachlorobenzene, pentachlorobenzene, 
hexachlorobenzene, 3,3-dichlorobiphenyl and 2, 3, 4, 5, 6- and 2, 2', 4, 5, 5'- 
pentachlorobipheny 1 . 

Other compounds which can be oxidised by the process include recalcitrant halo 
aromatic compounds, especially dioxins and halogenated dibenzofurans, and the 
corresponding compounds where one or both oxygen atoms is/are replaced by sulphur, in 
particular compounds of the formula. 




which possess at least one halo substituent, such as dioxin itself, 
2,3,7,8-tetrachlorodibenzodioxin. 

The oxidation typically gives rise to 1, 2 or more oxidation products. These 
oxidation products will generally comprise 1 or more hydroxyl groups. Generally, 
therefore, the oxidation products are phenols which can readily be degraded. It is 
particularly noteworthy that pentachlorobenzene and hexachlorobenzene can be oxidised in 
this way since they are very difficult to degrade. In contrast the corresponding phenols can 
be readily degraded by a variety of Pseudomonas and other bacteria. The atom which is 
oxidized is generally a ring carbon. 

The enzyme is typically a natural monooxygenase or a mutant thereof. The 
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natural monooxygenase is generally a prokaryotic or eukaryotie enzyme. Typically it is a 
haem-containing enzyme and/or a P450 enzyme. The monooxygenase may or may not be 
a TfdA (2,4-dichlorophenoxy) acetate/a-KG dioxygenase. The monooxygenase is 
generally of microorganism (e.g. bacterial), fungal, yeast, plant or animal origin, typically 
5 of a bacterium of the genus Pseudomonas. These organisms are typically soil, fresh water 
or salt water dwelling. In the case of a mutant monooxygenase the non-mutant form may 
or may not be able to oxidize the substrate. 

The monooxygenase typically has a coupling efficiency of at least 1%, such as at 
least 2%, 4%, 6% or more. The monooxygenase typically has a product formation rate of 

10 at least 5 min* 1 , such as at least 8, 10, 15 , 20, 25, 50, 100, 150 min -1 or more. The coupling 
efficiency or product formation rate is typically measured using any of the substrates or 
conditions mentioned herein. Thus they are typically measured in the in vitro conditions 
described in Example 2, in which case the relevant monooxygenase, reductase and redoxin 
would be present instead of, but at the same concentration as, P450 cam> putidaretoxin 

1 5 reductase and putidaretoxin. 

The mutant typically has at least one mutation in the active site. A preferred 
mutant comprises a substitution of an amino acid in the active site by an amino acid with a 
less polar side chain. Thus the amino acid is typically substituted with an amino acid 
which is above it in Table 1 . 
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Table 1 . HYDROPATHY SCALE FOR AMINO ACID SIDE CHAINS 



Side Chain Hydropathy 
5 





lie 


4.5 




Val 


4.2 


10 


Leu 


3.8 




Phe 


2.8 




Cys 


2.5 




Met 


1.9 




Ala 


1.8 


15 


Gly 


-0.4 




Thr 


-0.7 




Ser 


-0.8 




Trp 


-0.9 




Tyr 


-1.3 


20 


Pro 


-1.6 




His 


-3.2 




Glu 


-3.5 




Gin 


-3.5 




Asp 


-3.5 


25 


Asn 


-3.5 




Lys 


-3.9 




Arg 


-4.5 



An amino acid 'in the active site' is one which lines or defines the site in which 
30 the substrate is bound during catalysis or one which lines or defines a site through which 
the substrate must pass before reaching the catalytic site. Therefore such an amino acid 
typically interacts with the substrate during entry to the catalytic site or during catalysis. 
Such an interaction typically occurs through an electrostatic interaction (between charged 
or polar groups), hydrophobic interaction, hydrogen bonding or van der Waals forces. 
35 The amino acids in the active site can be identified by routine methods to those 

skilled in the art. These methods include labelling studies in which the enzyme is allowed 
to bind a substrate which modifies (Mabels') amino acids which contact the substrate. 
Alternatively the crystal structure of the enzyme with bound substrate can be obtained in 
order to deduce the amino acids in the active site. 



a o o :i a ~r ~s n , o Hfi4- o i? 



WO 00/78973 PCT/GBOO/02379 

The monooxygenase typically has 1, 2, 3, 4 or more other mutations, such as 
substitutions, insertions or deletions. The other mutations may be in the active site or 
outside the active site. Typically the mutations are in the 'second sphere' residues which 
affect or contact the position or orientation of one or more of the amino acids in the active 
5 site. The insertion is typically at the N and/or C terminal and thus the enzyme may be part 
of a fusion protein. The deletion typically comprises the deletion of amino acids which are 
not involved in catalysis, such as those outside the active site. The monooxygenase may 
thus comprise only those amino acids which are required for oxidation activity. 

The other mutations in the active site typically alter the position and/or 

10 conformation of the substrate when it is bound in the active site. The mutation may make 
the site on the substrate which is to be oxidized more accessible to the haem group. Thus 
the mutation may be a substitution to an amino acid which has a smaller or larger, or more 
or less polar, side chain. 

The other mutations typically increase the stability of the protein, or make it 

15 easier to purify the protein. They typically prevent the dimerisation of the protein, typically 
by removing cysteine residues from the protein (e.g. by substitution of cysteine at position 
334 of P450 cam , or at an equivalent position in a homologue, preferably to alanine). They 
typically allow the protein to be prepared in soluble form, for example by the introduction 
of deletions or a poly-histidine tag, or by mutation of the N-terminal membrane anchoring 

20 sequence. The mutations typically inhibit protein oligomerisation, such as oligomerisation 
arising from contacts between hydrophobic patches on protein surfaces. 

Typically the mutant monoxygenase is at least 70% homologous to a natural 
monooxygenase on the basis of amino acid identity. 

Any of the homologous proteins mentioned herein are typically at least 70% 

25 homologous to a protein or at least 80 or 90% and more preferably at least 95%, 97% or 

99% homologous thereto over at least 20, preferably at least 30, for instance at least 40, 60 
or 100 or more contiguous amino acids. The contiguous amino acids may include the 
active site. This homology may alternatively be measured not over contiguous amino acids 
or nucleotides but over only the amino acids in the active site. 

30 The monoxygenase is preferably: 

(i) P450 cam> 

(ii) a naturally occurring homologue of (i), 
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(iii) a mutant of (i) or (ii). 

Typically (i) is any allelic variant of P450 cam of Pseudomonas putida (e.g. of the 
polypeptide sequence shown in SEQ ID No. 1). Typically (ii) is a species homologue of (i) 
which has sequence homology with (i), and is typically P450 BM . 3 of Bacillus megaterium 
5 (e.g. the polypeptide sequence shown in SEQ ED No. 2), P450 lcfp of Pseudomonas sp, 
P450 ciyF of Saccharopollyspora erythraea, or P450 105 Dl (CYP105) of Streptomyces 
griseus strains. 

The active site of (ii) or (iii) may be substantially the same as the active site of (i) 
or any of the mutants of (i) mentioned herein. Thus the site may comprise the same amino 

10 acids in substantially the same positions. 

Typically in (iii) amino acid 96 of P450 cam , or the equivalent amino acid in a 
homologue, has been changed to an amino acid with a less polar side chain. 

The 'equivalent 5 side chain in the homologue is one at the homologous position. 
This can be deduced by lining up the P450 cam sequence and the sequence of the homologue 

1 5 based on the homology between the two sequences. The PUJEUP, BLAST and BESTFIT 
algorithms can be used to line up the sequences (for example as described in Altschul S. F. 
(1993) J Mol Evol 36:290-300; Altschul, S, F etal (1990) J Mol Biol 215:403-10 and 
(Devereux et al (1984) Nucleic Acids Research 12, p3 87-3 95)). These algorithms can also 
be used to calculate the levels of homology discussed herein (for example on their default 

20 settings). The equivalent amino acid will generally be in a similar place in the active site 
of the homologue as amino acid 96 in P450 cam . 

The discussion below provides examples of the positions at which substitutions 
may be made in P450 cam . The same substitutions may be made at equivalent positions in 
the homologues. Standard nomenclature is used to denote the mutations. The letter of the 

25 amino acid present in the natural form is followed by the position, followed by the amino 
acid in the mutant. To denote multiple mutations in the same protein each mutation is 
listed separated by hyphens. The mutations discussed below using this nomenclature 
specify the natural amino acid in P450 cam , but it is to be understood that the mutation could 
be made to a homologue which has a different amino acid at the equivalent position. Note 

30 that the amino acid numbering shown in SEQ ID No. 1 for P450 cam does not correspond to 
the numbering used in the description to denote mutations. The numbering in SEQ ID No. 
1 is one more than the numbering in the description for a particular position. 
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An additional mutation is typically an amino acid substitution at amino acid 87, 
98, 101, 185, 244, 247, 248, 296, 395, 396 or a combination of these, for example as 
shown in table 2. 

The following combinations of substitutions are preferred: 
5 (i) Substitution at position 87 to amino acids of different side-chain volume, 

such as substitutions (typically of F) to A, L, I and W, combined with substitutions at 
position 96 to amino acids of different side-chain volume such as (typically Y to) A, L, F, 
and W. These combinations alter the space available in the upper part of the substrate 
pocket compared to the wild-type enzyme, for example, from Y96W-F87W (little space) to 
10 Y96A-F87A (more space), as well as the location of the space, for example from one side 
in Y96F-F87A to the other in Y96A-F87W. 

(ii) Substitution at position 96 to F combined with substitutions at positions 
1 85 and 395. Both T185 and 1395 are at the upper part of the substrate pocket, and 
substitution with A creates more space while substitution with F will reduce the space 

15 available and push the substrate close to the haem. 

(iii) Substitutions at position 96 to A, L, F, and W combined with substitutions 
at residues closer to the haem including at 101, 244, 247, 295, 296 and 396 to A, L, F, or 
W. These combinations will create or reduce space in the region of the different side- 
chains to offer different binding orientations to substrates of different sizes. For example, 

20 the combinations Y96W-L244A and Y96L-V247W will offer very different pockets for the 
binding of the substrate. 

(iv) Triple substitutions at combinations of positions 87, 96, 244, 247, 295, 
296, 395 and 396 with combinations of A, L, F, and W. The aim is to vary the size and 
shape of the hydrophobic substrate binding pocket. For example, the Y96A-F87A-L244A 

25 combination creates more space compared to the Y96F-F87W-V396L combination, thus 
allowing larger substrates to bind to the former while restricting the available binding 
orientations of smaller substrates in the latter. The combinations Y96F-F87W-V247L and 
Y96F-F87W-V295I have comparable substrate pocket volumes, but the locations of the 
space available for substrate binding are very different. The combination Y96F-F87L- 

30 V247A has a slightly larger side-chain volume at the 96 position than the combination 

Y96L-F87L-V247A, but the L side-chain at the 96 position is much more flexible and the 
substrate binding orientations will be different for the two triple mutants. 
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(v) The mutants with four or five substitutions were designed with similar 
principles of manipulating the substrate volume, the different flexibility of various side- 
chains, and the location of the space available in the substrate pocket for substrate binding 
so as to effect changes in selectivity of substrate oxidation. 
5 Mutations are generally introduced into the enzyme by using methods known in 

the art, such as site directed mutagenesis of the enzyme, PCR and gene shuffling methods 
or by the use of multiple mutagenic oligonucleotides in cycles of site-directed mutagenesis. 
Thus the mutations may be introduced in a directed or random manner. Typically the 
mutagenesis method produces one or more polynucleotides encoding one or more different 

10 mutants. In one embodiment a library of mutant oligonucleotides is produced which can 
be used to produce a library of mutant enzymes. 

The process is typically carried out in the presence of the natural cofactors of the 
monooxygenase. Thus typically in addition to the enzyme (a) and the substrate the process 
is carried out in the presence of an electron transfer reductase (b), an electron transfer 

15 redoxin (c), cofactor for the enzyme and an oxygen donor. In this system the flow of 
electrons is generally: cofactor —►(b)—*' (c) ~> (a). 

(b) is generally an electron transfer reductase which is able to mediate the 
transfer of electrons from the cofactor to (c), such as a naturally occurring reductase or a 
protein which has homology with a naturally occurring reductase, such as at least 70% 

20 homology; or a fragment of the reductase or homologue. (b) is typically a reductase of any 
of the organisms mentioned herein, and is typically a flavin dependent reductase, such as 
putidaredoxin reductase. 

(c) is generally an electron transfer redoxin which is able to mediate the transfer 
of electrons from the cofactor to (a) via (b). (c) is typically a naturally occurring electron 

25 transfer redoxin or a protein which has homology with a naturally occurring electron 
transfer redoxin, such as at least 70% homology; or a fragment of the redoxin or 
homologue. (c) is typically a redoxin of any of the organisms mentioned herein, (c) is 
typically a two-iron/two sulphur redoxin, such as putidaredoxin. 

The cofactor is any compound capable of donating an electron to (b), such as 

30 NADH. The oxygen donor is any compound capable of donating oxygen to (a), such as 
dioxygen. 

Typically (a), (b) and (c) are present as separate proteins; however they may be 
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present in the same fusion protein. Typically only two of them, preferably (b) and (c), are 
present in the fusion protein. Typically these components are contiguous in the fusion 
protein and there is no linker peptide present. 

Alternatively a linker may be present between the components. The linker 
5 generally comprises amino acids that do not have bulky side chains and therefore do not 
obstruct the folding of the protein subunits. Preferably the amino acids in the linker are 
uncharged. Preferred amino acids in the linker are glycine, serine, alanine or threonine. In 
one embodiment the linker comprises the sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C. 
The linker is typically from at least 5 amino acids long, such as at least 10, 30 or 50 or 
10 more amino acids long. 

In the process the concentration of (a), (b) or (c) is typically from 10* to 
10 2 M, preferably from 10* 6 to lO^M. Typically the ratio of concentrations of (a): (b) 
and/or (a): (c) is from 0.1:01 to 1:10, preferably from 1:0.5 to 1:2, or from 1:0.8 to 1:1.2. 
Generally the process is carried out at a temperature and/or pH at which the enzyme is 
15 functional, such as when the enzyme has at least 20%, 50%, 80% or more of peak activity. 
Typically the pH is from 3 to 1 1, such as 5 to 9 or 6 to 8, preferably 7 to 7.8 or 7.4. 
Typically the temperature is 10 to 90*C, such as 25 to 75°C or 30 to 60°C. 

In the process different monooxygenases may be present. Typically each of 
these will be able to oxidise different substrates, and thus using a mixture of 
20 monooxygenases will enable a wider range of substrates to be oxidised. 

In one embodiment the process is carried out in the presence of a substance able 
to remove hydrogen peroxide by-product (e.g. a catalase). 

In one embodiment the process is carried out in the presence of the enzyme, 
substrate and an oxygen atom donor, such as hydrogen peroxide or t-butylhydroperoxide, 
25 for example using the peroxide shunt. 

In one embodiment in the process the (a), (b) and (c) together are typically in a 
substantially isolated form and/or a substantially purified form, in which case together they 
will generally comprise at least 90%, e.g., at least 95%, 98% or 99% of the protein in the 
preparation. 

30 The process may be carried out inside or outside a cell. The cell is typically in 

culture, at a locus, in vivo or in planta (these aspects are discussed below). 

The process is typically carried out at a locus such as in land (e.g. in soil) or in 
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water (e.g. fresh water or sea water). When it carried out in culture the culture typically 
comprises different types of cells of the invention, for example expressing different 
monooxygenases of the invention. Generally such cells are cultured in the presence of 
assimible carbon and nitrogen sources. 
5 Typically the cell in which the process is carried out is one in which the 

monooxygenase does not naturally occur. In another embodiment the monooxygenase is 
expressed in a cell in which it does naturally occur, but at higher levels than naturally 
occurring levels. The cell may produce 1, 2, 3, 4 or more different monooxygenases of the 
invention. These monoxygenases may be capable of oxidising different halo aromatic 
10 compounds. Typically the cell also expresses any of the reductases and/or redoxins 
discussed above. 

The cell is typically produced by introducing into a cell (i.e. transforming the cell 
with) a vector comprising a polynucleotide that encodes the monooxygenase. The vector 
may integrate into the genome of the cell or remain extrachromosomal. The cell may 

15 develop into the animal or plant discussed below. Typically the coding sequence of the 
polynucleotide is operably linked to a control sequence which is capable of providing for 
the expression of the coding sequence by the host cell. The control sequence is generally a 
promoter, typically of the cell in which the monooxygenase expressed. 

The term "operably linked" refers to a juxtaposition wherein the components 

20 described are in a relationship permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under conditions compatible with the control 
sequences. 

The vector is typically a transposon, plasmid, virus or phage vector. It typically 
25 comprises an origin of replication. It typically comprises one or more selectable marker 
genes, for example an ampicillin resistance gene in the case of a bacterial plasmid. The 
vector is typically introduced into host cells using conventional techniques including 
calcium phosphate precipitation, DEAE-dextran transfection, or electroporation. 

Components (b) and (c) may be expressed in the cell in a similar manner. 
30 Typically (a), (b) and (c) are expressed from the same vector, or may be expressed from 
different vectors. They may be expressed as three different polypeptides. Alternatively 
they may be expressed in the form of fusion proteins. The cell typically expresses more 
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than one type of monooxygenase. 

In one embodiment the three genes encoding the three protiens of the P450 caro 
system, i.e. camA, camB, and camC are placed in the mobile regions of standard 
transposon vectors and incorporated into the genome of Pseudomonas and flavobacteria. 
5 Alternatively plasmid vectors for expressing these genes may used, in which case the 
P450 cam gene cluster will be extra-chromosomal. 

The cell may be prokaryotic or eukaryotic and is generally any of the cells or of 
any of the organisms mentioned herein. Preferred cells are Pseudomanas, flavobacteria or 
fungi cells (e.g. Aspergillus). In one embodiment the cell is one which in its naturally 
10 occurring form is able to oxidise any of the substrates mentioned herein. Typically the cell 
is in a substantially isolated form and/or substantially purified form, in which case it will 
generally comprise at least 90%, e.g. at least 95%, 98% or 99% of the cells or dry mass of 
the preparation. 

The invention provides a transgenic animal or plant whose cells are any of the 

15 cells of the invention. The animal or plant is transgenic for the monooxygenase gene and 
typically also an appropriate electron transfer reductase and/or redoxin gene. They may be 
homozygote or heterozygote for such genes, which are typically transiently introduced into 
the cells, or stably integrated. (e.g. in the genome). The animal is typically a worm (e.g 
earthworm) or nematode. The plant or animal may be obtained by transforming an 

20 appropriate cell (e.g. embryo stem cell, callus or germ cell), fertilising the cell if required, 

allowing the cell to develop into the animal or plant and breeding the animal or plant true if 
required. The animal or plant may be obtained by sexual or asexual (e.g. cloning) 
propagation of an animal or plant of the invention or of the Fl organism (or any generation 
removed from the Fl, or the chimera that develops from the transformed cell). 

25 As discussed above the process may be carried out at a locus. Thus the invention 

also provides a method of treating a locus contaminated with a halo aromatic compound 
comprising contacting the locus with a monooxygenase, cell, animal or plant of the 
invention. These organisms are then typically allowed to oxidise the halo aromatic 
compound. In one embodiment the organisms used to treat the locus are native to the 

30 locus. Thus they may be obtained from the locus (e.g. after contamination), 

transformed/transfected (as discussed above) to express the monooxygenase (and 
optionally an appropriate electron transfer reductase and/or redoxin. 
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In one embodiment the locus is treated with more than one type of organism of 
the invention, e.g. with 2, 3, 4, or more types which express different monooxygenases 
which oxidise different halo aromatic compounds. In one embodiment such a collection of 
organisms between them is able to oxidise all halobenzenes, e.g. all chlorobenzenes. 
5 The organisms (e.g. in the form of the collection) may carry out the process of 

the invention in a bioreactor (e.g. in which they are present in immobilised form). Thus the 
water or soil to be treated may be passed through such a bioreactor. Soil may be washed 
with water augmented with surfactants or ethanol and then introduced into the bioreactor. 

The invention also provides a process for selecting a mutant of a monooxygenase 

10 for its ability to oxidise any of the substrates mentioned herein, which process comprises 
screening a library of said mutants for their oxidation effect on the substrate. Thus 
typically the substrate is provided to the library and mutants are selected based on their 
ability to oxidise the substrate, for example at a particular rate or under particular 
conditions. The mutant may be selected based on its ability to oxidise the substrate to a 

1 5 particular oxidation product. 

Typically the library will be in the form of cells which comprise the mutant 
enzymes. Generally each cell will express only one particular mutant enzyme. The library 
typically comprises at least 500 mutants, such as at least 1,000 or 5,000 mutants, preferably 
at least 10,000 different mutants. 

20 The library typically comprises a random population of mutants. The library 

may undergo one or more rounds of selection whilst being produced and therefore may not 
comprise a random population. 

The library is typically produced by contacting any of the cells discussed herein 
which expresses the monooxygenase with a mutagen and/or when the cell is a mutator cell 

25 culturing the cell in conditions in which mutants are produced. The mutagen may be 

contacted with the cell prior to or during culturing of the cell. Thus the mutagen may be 
present during replication of the cell or replication of the genome of the cell. 

The mutagen generally causes random mutations in the polynucleotide sequence 
which encodes (a). The mutagen is typically a chemical mutagen, such as 

30 nitrosomethyguanidine, methyl- or ethylmethane sulphonic acid, nitrite, hydroxylamine, 
DNA base analogues, and acridine dyes, such as proflavin. It is typically electromagnetic 
radiation, such as ultra-violet radiation at 260 nm (absorption maximum of DNA) and X- 
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rays. It is typically ionising radiation. 

A mutator cell is generally deficient in one or more of the primary DNA repair 
pathways (such as RColi pathways mutS, mutD or rnutT, or their equivalents in another 
organism), and thus has a high mutation rate. Simply culturing such cell leads to the DNA 
5 encoding (a) to become mutated. The cell may be of RColi XL1 Red mutator strain. 

The mutant selected from the library may be used in any aspect of the invention, 
thus it may be used to oxidise a substrate in the process of the invention or may be 
expressed in the cell, animal or plant of the invention. It may be used in the method of 
treating a locus. 

10 The invention is also illustrated by the Examples: 

Example 1 

Expression of mutants for in vitro work. 

The P450 cam enzymes were expressed using the vector pRH1091 (Baldwin, J.E., 
Blackburn, J.M, Heath, R.J., and Sutherland, J.D. Bioorg, Med Chem, Letts., 1992, 2, 

15 663-668.) which utilised the trc promoter (a fusion of the trp and lac promoters). This 

vector incorporates a strong ribosome binding site (RBS) and the gene to be expressed is 
cloned using an Nde I site on the 5* end of the gene. We used Hind HI as the cloning site at 
the 3' end of the camC gene. The procedure for protein expression is as follows: Cells are 
grown at 30°C until the OD 600nm reaches 1.0 — 1.2, the temperature is increased to 37°C 

20 and camphor added as a 1 M stock in ethanol to a final concetration of 1 mM. The culture 
is allowed to incubate at 37°C for another 6 hours. The P450 cam protein is expressed to high 
levels in the cytoplasm and the cells take on a red to orange-red colour. 

We have also prepared a variant of pRH1091 (by PCR) which has a extra Xba I site 
between the RBS and the Nde I site. This is important because Nde I is not unique in M13, 

25 and this restriction site is also present in the reductase gene as well as the backbone of the 
pGLWl 1 vector used for the in vivo system. Xba I is unique in the poly linker region of 
Ml 3, but absent in the genes of all three proteins in the P450 cam system and in the 
expression vectors. It therefore allows the camC gene to be moved between the mutagenic 
and expression vectors. 
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How the mutants were made. 

Oligonucleotide-directed site-specific mutagenesis was carried out by the Kunkel 
method (Kunkel, T. A. Proc. Natl. Acad Sci. USA 1985 , 82, 488-492) using the Bio-Rad 
Mutagen kit. The recommended procedure is summarised as follows. An M13 mpl9 
5 subclone of the camC gene encoding P450 cam was propagated in the E coli strain CJ236. 
This strain has the rfw/ungphenotype and thus will tolerate the inclusion of uracil in place 
of thymine in DNA molecules. After three cycles of infection, uracil-containing single 
stranded (USS) Ml 3 DNA was readily isolated by phenol extraction of mature M13 phage 
particles excreted into the growth medium. The mutagenic oligonucloetide (or 

10 oligonucleotides) were phosphorylated with T4 polynucleotide kinase and then annealed to 
the USS template. The four nucleotides, DNA polymerase, DNA ligase, ATP and other 
chemical components were added and the second strand was synthesised in vitro. The 
double stranded form thus obtained was transformed into the dut+ ting+ E. coli strain 
MV1 190, which should degrade the uracil-containing template strand and propagate the 

15 mutant strand synthesised in vitro. Plaques were picked and phages of possible mutants 
grown in K coli strains MV1 190 or TGI . The single-stranded DNA from these were 
sequenced to determine whether the mutagenesis reaction was successful. The mutagenic 
efficiency was 50 - 80%. 

The mutant camC gene is excised from the M 13 subclone by restriction digest with 

20 Nde I and Hind III, and the fragment of appropriate size is ligated to the backbone of the 
expression vector prepared by a similar Nde VHind III digest. 

Multiple mutants were prepared either by further mutagenesis, also by the Kunkel 
method, or where the location of the sites in the sequence permits, simple cloning steps. 
There are two unique restriction sites within the camC gene which are absent from the 

25 expression vector. One is Sph I which spans residues 121 - 123, and the other is Sal I which 
spans residues 338 and 339. Therefore, all mutations at, for example, residues 87, 96, 98, 
and 101 are readily combined with mutations at higher number residues by ligating 
appropriate fragments from restriction digests of mutant camC genes with Nde VSph I and 
Sph VHind III and the backbone fragment from a Nde VSph I digest of the expression 

30 vector. Mutations at, for example, 395 and 396 can be similarly incorporated by digests in 
which Sph I is replaced with Sal I. 

The rationale for introducing the unique Xba I site is now clear: many mutants with 
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multiple mutations were prepared by the cloning procedure above. Without thenar I site it 
would be impossible to clone the gene for these multiple mutants from the expression 
vector back into M13 for further rounds of mutagenesis. Of course these problems could be 
overcome by doing mutagenesis by PCR, for example. 

5 Example 2 

Substrate oxidation protocol: in vitro reactions 



Component Final concentration 

P450 oam enzyme 1 jiM 

Putidaredoxin 10 *iM 

10 Putidaredoxin reductase 1 ^M 

Bovine liver catalase 20 ng/ml 

KC1 200 mM 

Substrate Typically 1 mM 

NADH 250 - 400 jiM 



15 * 50 mM Tris-HCl buffer pH 7.4 is added to make up the volume. 

* Temperature controlled at 30°C, optional. 

* The NADH turnover rate could be determined by monitoring the absorbance at 
340 nm with time. 

* Catalase does not catalyse the substrate oxidation reactions but rather it is present 
20 to remove any hydrogen peroxide by-product which could otherwise denature the P450 CMn . 

The method can be increased in scale to, for example, 20 ml total incubation 
volume to allow purification of sufficient products by HPLC for spectroscopic 
characterisation. Fresh substrate (1 mM) and NADH (1-2 mM) are added periodically, 
such as every 20 minutes in a total reaction time of, typically, 3 hours. 

25 Example 3 

The in vivo system 

The in vivo systems were expressed using the vector pGLWl 1, a derivative of the 
plasmid pKK223 (Brosius, J. and Holy, A. Proc. NatL Acad ScL USA t 1984, 81,6929- 
6933). Expression is directed by the tac promoter and the vector incorporates a gene 
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conferring resistance to the antibiotic ampicillin. 

Two systems were constructed. The first one expressed the electron transfer 
proteins putidaredoxin reductase (camA gene) and putidaredoxin (camB gene) as a fusion 
protein with a seven amino acid peptide linker, and the P450 cam enzyme (camC gene) was 
expressed by the same vector but it was not fused to the electron proteins. The second 
system expressed the three proteins as separate entities in the EColi host. Both systems 
were catalytically competent for substrate oxidation in vivo. 

The general strategy was as follows. The genes for the three proteins were cloned 
using Eco RI and Hind III as flanking sites, with Eco RI at the 5' end. For both in vivo 
systems there are restriction sites between the genes, including between the reductase and 
redoxin genes in the fusion construct. These restriction sites were introduced by PCR, as 
detailed below. The first task, however, was to carry out a silent mutation to remove the 
Hind HI site within the camA gene for the reductase. The AAGCTT Hind III recognition 
sequence in the camA gene was changed to AAGCCT, which is a silent mutation because 
GCT and GCC both encode alanine. The gene was completely sequenced to ensure that 
there were no spurious mutations. 

1. The fusion protein system 

l.a Manipulation of the cantA gene by PCR 

For the camA gene the primer below was used at the 5 1 end of the gene to introduce 
the Eco RI cloning site and to change the first codon from GTG to the strong start codon 
ATG, 

5'- GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC GCA AAC 
EcoRl RBS \-camA 

At the 3* end of camA the primer was designed such that 15 bases are 
complementary to nucleotide sequence of the last five amino acid residues of camA. The 
stop codon immediately after the GCC codon for the last amino acid was removed, and 
then part of a seven amino acid linker (Thr Asp Gly Gly Ser Ser Ser) which contained a 
Bam HI cloning site (GGATCC = Gly Ser) was introduced. The coding sequence was 
thus: 
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5'- GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3 1 



camA 



- Thr Asp Gly Gly Ser 



l^amfflj 



The primer sequence shown below is the reverse complement used for PGR: 
5'- CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3' 

Lb Manipulations of the camB gene by PCR 

For the camB gene the primer at the 5* end incorporated the second half of the 
peptide linker between the reductase and redoxin proteins, and the restriction site Bam HI 
for joining the two amplified genes together. 

5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
Gly Ser Ser Ser |- camB 
\BamUl\ Start 

At the 3* end of camB the primer incorporates 12 nucleotides complementary to the 
end of camB followed by the stop codon TAA, a 6 nucleotide spacer before the GGAG 
ribosome binding site. Xba I and Hind III sites were then added to allow cloning of the 
camC gene when required. The sequence of the coding strand was therefore: 
5'- CCC GAT AGG C AA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT TCA TCG-3* 



The primer shown below is the reverse complement used for PCR: 

5»-CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA TTG CCT ATC GGG -3' 

Lc Preparation of the full fusion construct 

The camA and camB genes were amplified by the PCR using the primers described 
above. The new camA was digested with Eco RI and Bam HI, while the new CamB was 
digested with Bam HI and Hind HI. The pGLWl 1 expression vector was digested with 
Eco RI and Hind HI. All three were purified by agarose gel electrophoresis and the three 
gel slices containing the separate fragments were excised from the gel and ligated together, 
and then transformed into EColi DH5a. Successful ligation of all the fragments were 
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confirmed by a series of restriction digestion experiments, especially the presence of the 
new and unique Xba I site. - The entire sequence of the insert from the Eco RI site to the 
Hind HI site was determined to ensure that all the sequences were correct. 

The new plasmid, named pSGB F * was transformed into KColi and expression of the 
5 reductase and redoxin proteins was induced by DPTG. When a purified P450 cam enzyme 
was added to the cell-free extract, substrate oxidation was observed for a variety of 
substrates. 

When the camC gene is cloned into the pSGB F plasmid using the Xba I and Hind 
III restriction sites, the new recombinant plasmid thus generated expresses the reductase 
10 and redoxin as a fusion protein and the P450 cam enzyme as a operate entity both from the 
same mRNA molecule. This in vivo system is catalytically competent for terpene 
oxidation in whole cells. 

2. The in vivo system with the protein expressed separately 

2.a The basic strategy 

1 5 The starting point of the preparation of this in vivo system was the recombinant 

plasmid used to express the camA gene for putidaredoxin reductase. The camA gene was 
cloned into the pGLWl 1 plasmid using the Eco RI and Bam HI restriction sites, with Eco 
RI being at the 5' end of the gene. Conveniently the poly linker region of the pGLWl 1 
vector has a HindlU site downstream of the Bam HI site. The camB gene was therefore 

20 manipulated by PCR such that it can be cloned into pGLWl 1 using the Bam HI and Hind 
ID sites. This new plasmid expresses the reductase and redoxin as separate proteins. 

The camB gene was cloned into pUCl 18 by the Bam HI and Hind TEL cloning sites 
to express putidaredoxin for our general in vitro substrate oxidation work. Therefore, the 
PCR primer at the 3* end of the camB gene was designed to introduce a ribosome binding 

25 site and the Xba I restriction site upstream of the Hind III site so that the camC gene can be 
inserted downstream of camB using the Xba I and Hind TR sites. Therefore the three genes 
were cloned without fusion in the pGLWl 1 expression vector and arranged in the order 5'- 
camA-camB-camC-3', and each gene has its own RBS to initiate protein synthesis. 
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2.6 Manipulations of the camB gene 

We used the internal and unique restriction site Mlu I (recognition sequence 
ACGCGT) within the camB gene as the starting point so that the PCR product has a 
different size from the PCR template fragment. The primers were as follows: 

5'- TCA TCG ACG CGT CGC GAA CTG CTG-3' 

where the Mlu I site is in bold. 

The desired coding sequence at the 3 f end of the camB gene was: 
5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB - 1 stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3' 
RBS Xba I 

After the stop codon there is a 35 base spacer before the RBS which is used to 
initiate the synthesis of the P450 cam enzyme. The Xba I cloning site is located within the 
spacer between the RBS and the start codon (not in this primer)of the camC gene. The 
PCR primer used was the reverse complement of the sequence above. The PCR was 
carried out and the amplified fragment of the appropriate size was purified by agarose gel 
electrophoresis and the gel slice excised. 

One extra step was necessary to complete the construction of the new plasmid. The 
plasmid for the fusion protein in vivo system was digested with Mlu I and Hind III 
restriction enzymes, purified by agarose gel electrophoresis, and the gel slice for the small 
camB fragment excised. The pUCl 18 plasmid for camB expression was similarly digested, 
and the gel slice for the backbone was excised. By ligating the two fragments together we 
prepared a new pUCl 18-based plasmid which had m Xba I site followed by an Hind m 
site downstream of the stop codon of camB. This new plasmid was digested with the Mlu I 
and Xba I enzymes and the backbone was ligated with the new camB fragment described 
above to generate a plasmid with the following arrangement of the key components: 
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.Jac Promoter. .Bam ¥H..camB gene.. spacer. .RBS. Xba l..HindTSl.. 



2*c Preparation of the in vivo system plasmid 

Once the modified camB with the Xba I and Hind IK restriction sites and 
appropriate spacers were prepared, the /// vivo system was constructed by cloning this into 
the pGLWl 1 -based plasmid used to express the camA gene (reductase protein) using the 
Bam HI and HindTR sites. The new in vivo system vector has the following arrangement 
of the key components: 

.Jac Promoter. .Eco IRI..RBS..caniA gencspacer..Bam HL.RBS.camB 
gene^spacer.RBS. Xba L. Hind HI.. 

This new plasmid, named pSGB\ was transformed into KColi and expression of 
the reductase and redoxin proteins was induced by IPTG. When a purified P450 cam enzyme 
was added to the cell-free extract, substrate oxidation was observed for a variety of 
substrates. 

When the camC gene is cloned into this pSGC + plasmid using the Xba I and Hind 
III restriction sites, the new recombinant plasmid thus generated will express the three 
proteins separately, each under the direction of its own RBS but from the same mRNA 
molecule. Thus constitutes the in vivo system used in the vast majority of our terpene 
oxidation work. 

3. Introduction of an Xba I site into pRH1091 

This is the final step to enable the camC gene to be cloned into the in vivo systems 
by the two cloning sites Xbal and Hind in. The Xba I site was added by PCR of the entire 
pRH1091 plasmid using two primers. The presence of these two sites will also enable 
cloning of the camC gene into M13 since both Xba I and Hind HI are unique in camC and 
M13. 

The primers shown below maintain the Hind EH cloning site AAGCTT: 

5'-TCA TCG AAG CTT GGC TGT TTT-3' 
HindUl |- vector 
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At the other end the coding sequence desired was: 

5'-ACA ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT TCA TCG-3' 
Vector - |RBS Xba I Nde I Hindm 

This sequence maintained the Nde I and Hind HI sites but the new Xba I site was 
introduced upstream of the Nde I site. The PCR primer used was the reverse complement 
of the desired sequence: 

5'-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG TGA AAT TGT-3' 

The PCR product was then purified by agarose gel electrophoresis, digested with 
Hind HI and circularised with T4 DNA ligase. Success of the PCR method was indicated 
by the presence of a new and unique Xba I site in plasmid DNA isolated from 
transform ants. 



4, Cloning of camC into the in vivo systems 

All existing camC mutants were cut out of pRH 1091 -based expression plastids with 
Nde I and Hind III. The new vector is similarly cut with the same restriction enzymes and 
the camC gene cloned into this plasmid with T4 DNA ligase. This DNA is transformed 
into KColi JM109 which then may be grown to express P4S0 CJUn . 

The camC gene is excised from the new vector using Xba I and Hind in restriction 
enzymes and cloned into either the in vivo vector systems or M13mpl9 for mutagenesis. 

5. In vivo expression and substrate turnover 

For protein expression, cells are grown in LBamp medium (tryptone 10 g/iitre, 
yeast extract 5 g/litre, NaCl 10 g/litre, 50 jig/ml ampicillin) at 30°C until the OD 600nm 
reaches 1.0-1 .2. IPTG (isopropyl-P-D-thiogalactopyranoside) was added to a final 
concentration of 1 >iM (from a 1 M stock in H 2 0) and the culture was incubated at 30°C 
overnight. 

For simple screening the substrate can be added to culture and the incubation 
continued. However, due to impurities from the culture media the cells were generally 
washed twice with 0.5 vol. of buffer P, (KH 2 PQ 4 6.4 g, K 2 HP0 4 .3H 2 0 25.8 g, H 2 0 to 4 
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litres, pH 7.4) and resuspended in 0.25 vol. oxygen saturated buffer P containing 24 mM 
glucose. Substrate was added to 1 mM and the incubation continued at 30°C. The reaction 
was allowed to run for 24 hours with periodic additions of substrate and glucose. 



Example 4 

5 The oxidation of halo aromatic compounds 



Mutant 


2,3,6- 
Trichlorophenol 


3,4,6- 
Trichlorophenol 


Coupling 
Efficiency 
(%) 


Product 
formation 
rate (min _1) 


Y96F 


75 


25 


18 


22 


Y96A 


77 


23 


14 


33 


Y96H 


54 


46 


3 




F87L-Y96F 


42 


58 


4 


8 


F87A-Y96F 


52 


48 


2 


3 


F87A-Y96-F-V247A 


43 


57 


4 


7 




Mutant 


2,3- 

Dichlorophenol 


3,4- 

Dichlorophenol 


Coupling 
Efficiency 

(%) 


Product 
formation 
rate (min " l) 


Y96A 


94 


6 


6 


19 


Y96F 


91 


9 


4 


8 


Y96A-V247L 


94 


6 


7 


20 


Y96L-V247A 


90 


10 


2 


0.7 


F87L-96F 


96 


4 


3 


5 


C334A 


95 


5 


2 


0.5 



20 All mutants have C334A. Coupling efficiency is the percentage of NADH 

consumed which was utilised for product formation, i.e. a percentage of the theoretical 
maximum efficiency. The product formation rates are given in (nmol product) (nmol 
P450 C4m )* ! (min)" 1 . The relative amount of product formed in each case is shown. 
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1,3- and 1,4-dichlorobenzene, 1, 2, 3- and 1, 3, 5-trichlorobenzene, 1, 2, 4, 5- and 1, 2, 3, 5- 
tetrachlorobenzene, and 2, 3, 4, 5, 6- and 2, 2', 4, 5, 5'- pentachlorobiphenyl were also 
found to be oxidised. 

Wild-type and mutant P450 CJlm enzymes were tested for their ability to oxidise 3,3- 
5 dichlorobiphenyl and 2,2' > 4,5,5 , -pentachlorobiphenyl. Results are shown in terms of NADH 
turnover. Rates are given as nanomol NADH consumed per nanomol P450 CBm enzyme per 
minute. 



P450 c . m enzyme 


3,3'- 


2^2 ^4^5, 5 — 




dichlorobiphenyl 


pentachlorobiphenyl 


Wild-type 


0.4 


not detected 


Y96F 


15 


1 


F87A-Y96F 


845 


165 


F87L-Y96F 


174 


13 


F87W-Y96F 


4 


3 


F87A-Y96F-V247A 


112 


12 


Y96A-V247L 


84 


37 


F87A-Y96F-L244A 


669 


321 


F8 7 A-Y97F-L244A- V247A 


173 


214 



The first product, 4-hydroxy-3 ,3 '-dichlorobiphenyl was identified by the 
characteristic coupling patterns expected in the *H NMR spectrum and by mass 
20 spectroscopy. The further oxidation product, 4 5 4 < -dihydroxy-3 > 3 -dichlorobiphenyl was 
identified by co-elution with an authentic sample, and by UV-vis and mass spectroscopy. 
This product did not constitute more than ca. 10% of the total products in any of the 
mutants tested. 

For the second substrate product was established as 4 t -hydroxy-2,2 , ,4,5,5- 
25 pentachlorobiphenyl by the observation of the parent ion in the mass spectrum, and by 
comparison with literature *H NMR data. 
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P450 c «* mutants 



All mutants optionally contain the base mutation C334A. 
Single mutants: Y96A, Y96F, Y96L, Y96W. 



Double mutants: 



Y96A-F87A 


Y96F-F87A 


Y96F-V295A 


Y96L-F87A 


Y96Ii-A296I* 


Y96A-F87I. 


Y96F-F87I 


Y96F-V295I. 


Y96L-F87I. 


Y96I.-A296F 


Y96A-F87W 


Y96F-F87L 


Y96F-V295I 


Y96I.-F98W 


Y96L-V396A 


Y96A-F98W 


Y96F-F87W 


Y96F-A296I. 


Y96L-T101I, 


Y96I.-V396L 


Y96A-I*244A 


Y96F-F98W 


Y96F-A296F 


Y96L-T101F 


Y96L-V396F 


Y96A-V247A 


Y96F-T101L 


Y96F-I395F 


Y96L-L244A 


Y96L-V396W 


Y96A-V247L 


Y96F-T101F 


Y96F-I395G 


Y96L-L244F 




Y96A-I395F 


Y96F-T185A 


Y96F-V396A 


Y96L-V247A 




Y96A-X395G 


Y96F-T185F 


Y96F-V396L 


Y96L-V247I* 


Y96W-F87W 




Y96F-T185L 


Y96F-V396F 


Y96L-V247F 


Y96W-F98W 




Y96F-L244A 


Y96F-V396W 


Y96L-V247W 


Y96W-L244A 




Y96F-V247A 




Y96I.-G248I, 


Y96W-V247A 




Y96F-V247Ia 




Y96J>-V295li 


Y96W-V396A 




Y96F-G248L 




Y96!L-V295F 





Triple Mutants : 
Y9 6A-F8 7A-I*2 4 4A 
Y96A-F87A-V247A 
Y96A-r87L-L244A 
Y96A-F87L-V247A 
Y96A-L244A-V247A 

Y96I^-F87A-I*244A 

Y96MS'87A-V247A 

Y96li-F87I--L244A 

Y96L-F87I*~V247A 

Y96Ir-V247A-I395F 

Y96L-V247L-I395F 

Y96I*-V247L~I395G 

Y96L-Z-244A-V396L 

496X.-X244A-V396F 

Y9 GL-I*2 4 4A- V3 9 6W 

Y9 6I--I-24 4F-V39 6A 



Y9 V2 4 7 A-V3 9 61* 
Y9 6L-V247A-V396F 
Y96L-V247A-V396W 
Y96L-V247F-V396A 

Y9 6F-F87A-L244A 
Y96F-F87A-V247A 
Y96F-F87A-V247L 
Y96F-F87A-I395F 
Y96F-F87A-I395G 
Y96F-F87I.-V247A 
Y96F-F87Ii-V247L 
Y96F-F87L-I395F 
Y96F-F87W-T185A 
Y96F-F87W-T18SF 
Y96F-F87W-Tl85Ii 
Y96F-F87W-L244F 



Y96F-F87W-V247A 

Y96F-F87W-V247L 

Y96F-F87W-V247F 

Y96F-F87W-V295I- 

Y96F-F87W-A296I. 

Y9 6F-F8 7W-V3 9 6A 

Y9 6F-F8 7W-V3 9 61* 

Y96F-V247F-V396A 

Y96F-I£44A-V396L 

Y96F-X244A-V396F 

Y96F-I*244A-V396W 

Y96F-L244F-V396A 

Y96F-V247A-V396L 

Y9 6F-V247A-V39 6F 

Y96F-V247A-V396W 

Y96W-F87W-F98W 
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CLAIMS 

I Process for Oxidising a substrate which is a halo aromatic compound, which 
process comprises oxidising said substrate with a monooxygenase enzyme. 

2. Process according to claim 1 in which the enzyme comprises a substitution 
5 of an amino acid in the active site by an amino acid with a less polar side-chain. 

3. Process according to claim 2 in which the enzyme comprises one or more 
other amino acid substitutions in the active site. 

4. Process according to any one of the preceding claims in which the enzyme 

is 

10 (i) P450 cam , or 

(ii) a naturally occurring homologue of (i), or 

(iii) a mutant of (i) or (ii). 

5. Process according to claim 4 in which the enzyme is one in which amino 
acid 96 of P450 CJmii or the equivalent amino acid in a homologue, has been changed to an 

15 amino acid with a less polar side-chain. 

6. Process according to any one of the preceding claims in which the halogen 
is chlorine. 

7. Process according to any one of the preceding claims in which the aromatic 
compound is a benzene or biphenyl. 

20 8 - Process according to any one of the preceding claims in which the substrate 

has more than one halogen atom. 

9. Process according to claim 8 in which the substrate is 1, 2-dichlorobenzene, 
1, 2, 4- trichlorobenzene, S^-dichlorobiphenyl or 2,2 , ,4,5 > 5 , -pentachlorobiphenyl. 

10. Process according to claim 8 in which the substrate is 
25 pentachlorobenzene or hexachlorobenzene. 

I I Process according to any one of the preceding claims which is carried out in 
a cell that expresses: 

(a) an enzyme as defined in any one of claims 1 to 5; 

(b) an electron transfer reductase; and 
30 (c) an electron transfer redoxin. 

12. Process according to claim 1 1 in which: 

(b) is putidaretoxin reductase or a homologue; or a fragment thereof; and/or 
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(c) is putidaretoxin or a homologue; or a fragment thereof 

13. Process according to claim 1 1 or 12 wherein the cell is one in which the 
enzyme (a) does not naturally occur. 

14. Process according to any one of claims 1 1 to 13 wherein the cell is 

one which in its naturally occurring form is able to oxidise a substrate as defined in any 
one of claims 6 to 10. 

15. A cell as defined in claim 14. 

1 6. A transgenic animal or plant whose cells are as defined in any one of 
claims 1 1 to 14. 

17. Method of treating a locus contaminated with a substrate as defined 
in any one of claims 1 or 6 to 10 comprising contacting the locus with an enzyme as 
defined in any one of claims 1 to 5 or a cell as defined in any of claims 1 1 to 13, or an 
animal or plant as defined in claim 16. 

18. Process for selecting a mutant of an enzyme as defined in claim 1, 4(i) or 
4(ii) for its ability to oxidise a substrate as defined in claim 1, or any one of the claims 6 to 
10, which process comprises screening a library of said mutants for their oxidation effect 
on the substrate. 

19. Process, cell, animal, plant or method according to any one of claims 1 to 17 
wherein the enzyme is one that has been selected in a process according to claim 18. 
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O (54) Title: PROCESS FOR OXIDISING AROMATIC COMPOUNDS 
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£^ (57) Abstract: Process for oxidising a substrate which is a halo aromatic compound, which process comprises oxidising said sub- 

r**~ straie with a monooxygenase enzyme. The enzyme may be P450cam. The process may be carried out in cells, animals or plants. 
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SEQUENCE LISTING 

<110> Isis Innovation Limited 
<120> Process for oxidising aromatic compounds 
<130> N76277A PEJ 
5 <170> Patent In Ver. 2.1 
<210> 1 
<211> 1242 
<212> DNA 

<213> Pseudomonas putida 
10 <220> 

<221> CDS 

<222> (1),.(1242) 

<400> 1 

acg act gaa acc ata caa age aac gec aat ctt gec cct ctg cca ccc 48 

15 Thr Thr Glu Thr He Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro Pro 
15 10 15 

cat gtg cca gag cac ctg gta ttc gac ttc gac atg tac aat ccg teg 96 
His Val Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro Ser 
20 25 30 

20 aat ctg tct gec ggc gtg cag gag gee tgg gca gtt ctg caa gaa tea 144 
Asn Leu Ser Ala Gly Val Gin Glu Ala Trp Ala Val Leu Gin Glu Ser 
35 40 45 

aac gta ccg gat ctg gtg tgg act cgc tgc aac ggc gga cac tgg ate 192 
Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp He 
25 50 55 60 

gec act cgc ggc caa ctg ate cgt gag gec tat gaa gat tac cgc cac 240 
Ala Thr Arg Gly Gin Leu He Arg Glu Ala Tyr Glu Asp Tyr Arg His 
65 70 75 80 



30 



ttt tec age gag tgc ccg ttc ate cct cgt gaa gec ggc gaa gee tac 
Phe Ser Ser Glu Cys Pro Phe He Pro Arg Glu Ala Gly Glu Ala Tyr 
85 90 95 



288 
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gac ttc att ccc acc teg atg gat ccg ccc gag cag cgc cag ttt cgt 336 
Asp Phe lie Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe Arg 
100 105 no 

gcg ctg gec aac caa gtg gtt ggc atg ccg gtg gtg gat aag ctg gag 384 
5 Ala Leu Ala Asn Gin Val Val Gly Met Pro Val Val Asp Lys Leu Glu 
115 120 125 

aac egg ate cag gag ctg gec tgc teg ctg ate gag age ctg cgc ccg 432 
Asn Arg lie Gin Glu Leu Ala Cys Ser Leu He Glu Ser Leu Arg Pro 
130 135 140 

10 caa gga cag tgc aac ttc acc gag gac tac gee gaa ccc ttc ccg ata 480 
Gin Gly Gin Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro He 
145 150 155 160 

cgc ate ttc atg ctg etc gca ggt eta ccg gaa gaa gat ate ccg cac 528 
Arg He Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp He Pro His 
15 165 170 175 

ttg aaa tac eta acg gat cag atg acc cgt ccg gat ggc age atg acc 576 
Leu Lys Tyr Leu Thr Asp Gin. Met Thr Arg Pro Asp Gly Ser Met Thr 
180 185 190 

ttc gca gag gee aag gag gcg etc tac gac tat ctg ata ccg ate ate 624 
20 Phe Ala Glu Ala Lys Glu Ala Leu Tyr Asp Tyr Leu He Pro He lie 
195 200 205 

gag caa cgc agg cag aag ccg gga acc gac get ate age ate gtt gec 672 
Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala He Ser He Val Ala 
210 215 220 



25 aac ggc cag gtc aat ggg cga ccg ate acc agt gac gaa gee aag agg 
Asn Gly Gin Val Asn Gly Arg Pro He Thr Ser Asp Glu Ala Lys Arg 
225 230 235 240 



720 



4 
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atg tgt ggc ctg tta ctg gtc ggc ggc ctg gat acg gtg gtc aat ttc 768 
Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn Phe 
245 250 255 

etc age ttc age atg gag ttc ctg gee aaa age ccg gag cat cgc cag 816 
5 Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg Gin 
260 265 270 

gag ctg ate gag cgt ccc gag cgt att cca gee get tgc gag gaa eta 864 
Glu Leu He Glu Arg Pro Glu Arg He Pro Ala Ala Cys Glu Glu Leu 
275 280 285 

10 etc egg cgc ttc teg ctg gtt gee gat ggc cgc ate etc ace tec gat 912 
Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg lie Leu Thr Ser Asp 
290 295 300 

tac gag ttt cat ggc gtg caa ctg aag aaa ggt gac cag ate ctg eta 960 
Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin lie Leu Leu 
15 305 310 315 320 

ccg cag atg ctg tct ggc ctg gat gag cgc gaa aac gee tgc ccg atg 1008 
Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro Met 
325 330 335 

cac gtc gac ttc agt cgc caa aag gtt tea cac acc acc ttt ggc cac 1056 
20 His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly His 
340 345 350 

ggc age cat ctg tgc ctt ggc cag cac ctg gec cgc egg gaa ate ate 1104 
Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu He lie 
355 360 365 



25 



gtc acc etc aag gaa tgg ctg acc agg att cct gac ttc tec att gec 
Val Thr Leu Lys Glu Trp Leu Thr Arg He Pro Asp Phe Ser lie Ala 
370 375 380 



1152 
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ccg ggt gcc cag att cag cac aag age ggc ate gtc age ggc gtg cag 1200 
Pro Gly Ala Gin He Gin His Lys Ser Gly He Val Ser Gly Val Gin 
385 390 395 400 

gca etc cct ctg gtc tgg gat ccg gcg act ace aaa gcg gta 1242 
5 Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 
405 410 



<210> 2 
<211> 3150 
<212> DNA 
10 <213> Bacillus megaterium 
<220> 
<221> CDS 
<222> (D..C3150) 

<400> 2 

15 atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 48 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 

15 10 15 



aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 20 25 30 



att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 



gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 192 
25 Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp 
50 55 60 



gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 
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gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 288 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 

tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 
5 Trp Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 

atg aaa ggc tat cat gcg atg atg gtc gat ate gec gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 

10 caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
15 145 150 155 160 

tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 528 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
20 Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 

aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

25 gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 
Asp He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 
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aaa gca age ggt gaa caa age gat gat tta tta acg eat atg eta aae 720 
Lys Ala Ser Gly Glu Gin Ser- Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aae att cgc 768 
5 Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 
245 250 255 

tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin lie He Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

10 ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct get cca 912 
Gin Lys Al a Al a Glu Glu Ala Ala Arg Val Leu Val Asp Pro Ala Pro 
15 290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aae 960 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 
20 Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 



25 



gaa eta atg gtt 
Glu Leu Met Val 
355 



ctg att cct cag ctt cac cgt gat aaa aca att tgg 
Leu lie Pro Gin Leu. His Arg Asp Lys Thr lie Trp 
360 365 
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gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 1152 
61 y Asp Asp Val 61 u Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 1200 
5 Ala He Pro 61n His Ala Phe Lys Pro Phe 61y Asn 61y 61n Arg Ala 
385 390 395 400 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 1248 
Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
405 410 415 

10 atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 1296 
Met Met Leu Lys His Phe Asp Phe 61 u Asp His Thr Asn Tyr Glu Leu 
420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 1344 
Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
15 435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 1392 
Ala Lys Ser Lys Lys He Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct gec aaa aaa gca cgc aaa aag gca gaa aac get cat aat 1440 
20 Glu Gin Ser Ala Lys Lys Ala Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 1488 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
485 490 495 
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acg gcg cgt gat 
Thr Al a Arg Asp 
500 



tta gca gat att gca atg age aaa gga ttt gca ccg 
Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 
505 510 
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cag gtc gca acg ctt gat tea cac gec gga aat ctt ccg cgc gaa gga 1584 
Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 1632 
5 Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 1680 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 560 

10 aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 1728 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gec get 1776 
Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 
15 580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 1824 
Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 1872 
20 Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gec tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 1920 
Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 



25 



tct act ctt tea ctt caa ttt 
Ser Thr Leu Ser Leu Gin Phe 
645 



gtc gac age gec gcg gat atg ccg ctt 
Val Asp Ser Ala Ala Asp Met Pro Leu 
650 655 
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gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 2016 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 2064 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 



ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 2112 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 
690 695 700 



10 cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 2160 
Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 



eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 2208 
Leu Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu 
15 725 730 735 



get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 2256 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 



tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 2304 
20 Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 



get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gec ttg 2352 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

25 ctt gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 2400 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 
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atg ctt gaa ctg ctt *gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 2448 
Met Leu Glu Leu Leu Glu Lys* Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 

gaa ttt ate gec ctt ctg cca age ata cgc ccg cgc tat tac teg att 2496 
5 Glu Phe He Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser He 
820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 
835 840 845 

10 gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
850 855 860 



15 



gcg teg aac tat ctt gec gag ctg caa gaa gga gat acg att acg tgc 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys 
865 870 875 880 



2640 



ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 
Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
885 890 895 



2688 



acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 
20 Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 
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ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 



2784 



25 gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 
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ctg tat caa gaa gag ctt gaa aac gcc caa age gaa ggc ate att acg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie He Thr 
945 950 955 960 

ctt cat acc get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 2928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 
980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

gcc gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 



cga tac gca aaa gac gtg tgg get ggg taa 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 

1045 1050 



3150 



N.76277B 



I hereby certify that this correspondence is being deposited with the U.S. 
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DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

As below named inventor(s), I/we hereby declare that 
This declaration is of the following type: 

j j original j" j design [ j supplemental 

[x] national stage of PCT 

j^J divisional ! 1 continuation j j ccntinuation-in-part 

My/our residence, post office address and citizenship are as stated below next to my/our name. 

I/we believe I/we am/are the original, first and sole inventor (if only one name is listed below) or an original, first 
and joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a patent is 
sought on the invention entitled: 

PROCESS FOR OXIDISING AROMATIC COMPOUNDS ^ 

the specification of which (check one) 

| | is attached hereto 
j | was filed on 

in the United States Patent and Trademark Office as Application Serial No. 
and was amended on (if applicable) 

J was described and claimed in PCT International Application No. GBOO/02379^- 
filed on 19 Jun 2000 ^ 

and as amended under PCT Article 19 on (if any) 
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I/we acknowledge the duty to disclose information which is material to patent ability as defined in 

37 C.F.R. § 1.56. 

I/we hereby claim foreign priority benefits under 35 U.S.C §1 19(a)-(d) or § 365(b) of any foreign 
apphcation(s) for patent or inventor's certificate or under § 365(a) of any PCT International Application(s) which 
designated at least one country other than the United States of America, listed below and have also identified below 
any foreign application for patent or inventor's certificate or PCT International Application having a filing date 
before that of the application on which priority is claimed: 
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PRIOR FOREIGN/PCT APPLICATION(S) AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. §119 



Application No. 


Country 


Filing Date 


Priority Claimed 
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No 


9914373.7 ' 


GB ^ 
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X 




GBOO/02379 




19Jun2000 ^ 







I/we hereby claim the benefit under 35 U.S.C. § 1 19(e) of any United States Provisional Application(s) listed below: 
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Application No. 


Filing Date 







I/we hereby claim the benefit under 35 U.S.C. § 120 of any United States Application(s) or § 365(c) of any PCT 
International Application(s) designating the United States of America, listed below and, insofar as the subject matter 
of each of the claims of this application is not disclosed in the prior United States or PCT International Application 
in the manner provided by the first paragraph of 35 U.S.C. § 1 12, I/we acknowledge the duty to disclose 
information which is material to patent ability as defined in 37 C.F.R. § 1.56 which became available between the 
filing date of the prior applicat ion and the national PCT international filing date of this application. 



PRIOR UNITED STATES/PCT INTERNATIONAL APPLICATION(S) 



Application No. 


Filing Date 


Status 
(patented, 
pending/abandoned) 


GBOO/02379 ^ 


19 Jun 2000 S 











I/we hereby appoint the following attorney (s) and/or agent(s) to prosecute this application and to transact all 
business in the U.S. Patent and Trademark Office connected therewith: 



Fulbright & Jaworski LLP, 1301 McKinnev Suj ff* 5100 fffMitfftffl T >xas 77010-3095 United States of America ^ 

I/we hereby declare that all statements made herein of my/our own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine or imprisonment, or both, under 
18 U.S.C § 1001 and that such willful false statements may jeopardise the validity of the application or any patent 
issued thereon. 
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Citzenship 
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Luet Lok WONG 
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Date 
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Postal Service as Express Mail. Airbill No EU098493435US, in an envelope 
addressed to Box Sequence, Commissioner for Patents, Washington, DC 
20231, on the date shown below 



Dated. April 4, 2002 Signature 



(Linda A Bourg) q) 
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Docket No.: HO-P02353US1 
(PATENT) 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Patent Application of: 
Luet L. Wong, et ah 

Application No.: 10/018,730 Group Art Unit: N/A 

Filed: December 18, 2001 

Intl Appln. No.: PCT/GB00/02379 Examiner: Not Yet Assigned 

Filed: June 19, 2000 

For: PROCESS FOR OXIDISING AROMATIC 
COMPOUNDS. 



RESPONSE TO NOTICE TO COMPLY WITH REQUIREMENTS 

FOR PATENT APPLICATION CONTAINING NUCLEOTIDE 
SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

Box Sequence 

Commissioner for Patents 
Washington, DC 20231 

Dear Sir: 

This is in response to the Notice To Comply With Requirements For Patent 
Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures, 
dated February 4, 2002. Applicants are submitting their response herewith within the two- 
month response period set to expire on April 4, 2002. 

The U.S. Patent and Trademark Office has taken the position that the previous 
"Sequence Listing" in computer readable form did not comply with the requirements of 
Section 1 .824 for the stated reason that it contained several raw sequence listing errors. 
Submitted herewith in connection with the above-referenced patent application is a substitute, 
computer readable copy of the Sequence Listing that corrects the defects noted in the raw 
sequence listing. No new matter has been added to the substitute Sequence Listing. 



Application No.: 10/018,730 



Docket No.: HO-P02353US1 



Applicants respectfully request the above-referenced patent application be placed 
upon the files for examination. 



If any fees are due in this matter, please charge our Deposit Account, No. 06-2375, 
under Order No. 101 12404 from which the undersigned is authorized to draw. 



Dated: April 4, 2002 Respectfully submitted, 

By ^jU^t&A Q <AL Jsf — 
Elizabeth A.^Hart, Register/d latent Agent 

Registration No.: 50,93 1 
FULBRIGHT & JAWORSKI L.L.P. 
1301 McKinney, Suite 5100 
Houston, Texas 77010-3095 
(713) 651-5698 
(713) 651-5246 (Fax) 
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SEQUENCE LISTING 

<110> Luet, Wong 

Jonathan, Jones 

<120> PROCESS FOR OXIDISING AROMATIC COMPOUNDS 

<130> P02353US1 / 10112404 / N.76277B 

<140> US 10/018,730 

<141> 2001-12-18 

<150> GB 9914373.7 

<151> 1999-06-18 

<150> PCT/GB0 0/02 37 9 

<151> 2000-06-19 

<160> 18 

<170> Patentln version 3.1 

<210> 1 

<211> 1242 

<212> DNA 

<213> Pseudomonas putida 

<400> 1 



acgactgaaa 


ccatacaaag 


caacgccaat 


cttgcccctc 


tgccacccca 


tgtgccagag 


60 


cacctggtat 


tcgacttcga 


catgtacaat 


ccgtcgaatc 


tgtctgccgg 


cgtgcaggag 


120 


gcctgggcag 


ttctgcaaga 


atcaaacgta 


ccggatctgg 


tgtggactcg 


ctgcaacggc 


180 


ggacactgga 


tcgccactcg 


cggccaactg 


atccgt gagg 


cctatgaaga 


ttaccgccac 


240 


ttttccagcg 


agtgcccgtt 


cat ccct cgt 


gaagccggcg 


aagcctacga 


cttcattccc 


300 


acctcgatgg 


atccgcccga 


gcagcgccag 


ttt cgtgcgc 


tggccaacca 


agtggttggc 


360 


atgccggtgg 


tggataagct 


ggagaaccgg 


atccaggagc 


tggcctgctc 


gctgat cgag 


420 


agcctgcgcc 


cgcaaggaca 


gtgcaacttc 


accgaggact 


acgccgaacc 


cttcccgata 


480 


cgcatcttca 


tgctgctcgc 


aggtctaccg 


gaagaagata 


tcccgcactt 


gaaataccta 


540 


acggat caga 


tgacccgtcc 


ggatggcagc 


atgaccttcg 


cagaggccaa 


ggaggcgctc 


600 


tacgactatc 


tgataccgat 


catcgagcaa 


cgcaggcaga 


agccgggaac 


cgacgctatc 


660 


agcatcgttg 


ccaacggcca 


ggtcaatggg 


cgaccgatca 


ccagtgacga 


agccaagagg 


720 


atgtgtggcc 


tgttactggt 


cggcggcctg 


gatacggtgg 


tcaatttcct 


cagcttcagc 


780 


atggagttcc 


tggccaaaag 


cccggagcat 


cgccaggagc 


tgatcgagcg 


tcccgagcgt 


840 


attccagccg 


cttgcgagga 


actact ccgg 


cgcttctcgc 


tggttgccga 


tggccgcatc 


900 


ctcacct ccg 


att acgagtt 


tcatggcgtg 


caactgaaga 


aaggtgacca 


gatcctgcta 


960 


ccgcagatgc 


tgt ctggcct 


ggatgagcgc 


gaaaacgcct 


gcccgatgca 


cgtcgacttc 


1020 
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agtcgccaaa aggtttcaca caccaccttt ggccacggca gccatctgtg ccttggccag 1080 

cacctggccc gccgggaaat catcgtcacc ctcaaggaat ggctgaccag gattcctgac 1140 

ttctccattg ccccgggtgc ccagattcag cacaagagcg gcatcgtcag cggcgtgcag 1200 

gcactccctc tggtctggga tccggcgact accaaagcgg ta 1242 

<210> 2 
<211> 414 
<212> PRT 

<213> Pseudomonas putida 
<400> 2 

Thr Thr Glu Thr lie Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro Pro 
15 10 15 



His Val Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro Ser 
20 25 30 



Asn Leu Ser Ala Gly Val Gin Glu Ala Trp Ala Val Leu Gin Glu Ser 
35 40 45 



Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp lie 
50 55 60 



Ala Thr Arg Gly Gin Leu lie Arg Glu Ala Tyr Glu Asp Tyr Arg His 
65 70 75 80 



Phe Ser Ser Glu Cys Pro Phe lie Pro Arg Glu Ala Gly Glu Ala Tyr 
85 90 " 95 



Asp Phe lie Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe Arg 
100 105 110 



Ala Leu Ala Asn Gin Val Val Gly Met Pro Val Val Asp Lys Leu Glu 
115 120 125 



Asn Arg lie Gin Glu Leu Ala Cys Ser Leu lie Glu Ser Leu Arg Pro 
130 135 140 



Gin Gly Gin Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro lie 
145 150 155 160 



Arg lie Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp lie Pro His 
165 170 175 



Leu Lys Tyr Leu Thr Asp Gin Met Thr Arg Pro Asp Gly Ser Met Thr 
180 185 " 190 



Phe Ala Glu Ala Lys Glu Ala Leu Tyr Asp Tyr Leu lie Pro lie lie 
195 200 205 



Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala lie Ser lie Val Ala 
210 215 220 



Asn Gly Gin Val Asn Gly Arg Pro lie Thr Ser Asp Glu Ala Lys Arg 
225 230 235 240 



Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn Phe 

245 250 255 



Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg Gin 
260 265 270 



Glu Leu lie Glu Arg Pro Glu Arg lie Pro Ala Ala Cys Glu Glu Leu 

275 280 285 



Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg lie Leu Thr Ser Asp 
290 295 300 



Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin lie Leu Leu 
305 310 315 320 



Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro Met 
325 330 335 



His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly His 
34 0 34 5 350 



Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu lie lie 
355 360 365 



Val Thr Leu Lys Glu Trp Leu Thr Arg lie Pro Asp Phe Ser lie Ala 
370 375 380 



Pro Gly Ala Gin He Gin His Lys Ser Gly He Val Ser Gly Val Gin 
385 390 395 400 



Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 
405 " 410 



<210> 3 
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<211> 3150 

<212> DNA 

<213> Bacillus megaterium 

<400> 3 



atgacaatta 


aagaaatgee 


teagecaaaa 


acgtttggag 


agcttaaaaa 


tttaccgtta 


60 


ttaaacacag 


ataaaceggt 


tcaagctttg 


atgaaaattg 


eggatgaatt 


aggagaaatc 


120 


tttaaatt eg 


aggegectgg 


tcgtgtaacg 


cgctacttat 


caagtcagcg 


tctaattaaa 


180 


gaagcatgcg 


at gaat cacg 


ctttgataaa 


aacttaagtc 


aagcgcttaa 


atttgtacgt 


240 


gattttgeag 


gagaegggtt 


atttacaagc 


tggacgcatg 


aaaaaaattg 


gaaaaaagcg 


300 


cataatatct 


tacttccaag 


cttcagt cag 


caggcaatga 


aaggctatca 


tgcgatgatg 


360 


gtcgatatcg 


ccgtgcagct 


tgttcaaaag 


tgggagcgtc 


taaatgeaga 


tgagcatatt 


420 


gaagtacegg 


aagacatgac 


acgtttaacg 


cttgatacaa 


ttggtctttg 


eggctttaac 


480 


tategcttta 


acagctttta 


ccgagatcag 


cct catccat 


ttattacaag 


tatggtccgt 


540 


gcactggatg 


aagcaatgaa 


caagctgeag 


cgagcaaatc 


cagacgaccc 


agcttatgat 


600 


gaaaacaagc 


gecagtttea 


agaagatatc 


aaggtgatga 


acgacctagt 


agataaaatt 


660 


attgeagate 


gcaaagcaag 


cggtgaacaa 


agegatgatt 


tattaacgea 


t atgetaaac 


720 


ggaaaagatc 


cagaaaeggg 


tgagecgett 


gatgacgaga 


acatt cgcta 


tcaaattatt 


780 


acattcttaa 


ttgegggaca 


cgaaacaaca 


agtggtcttt 


tatcatttgc 


gctgtatttc 


840 


ttagtgaaaa 


atccacatgt 


attacaaaaa 


gcagcagaag 


aagcagcacg 


agttctagta 


900 


gatcctgctc 


caagctacaa 


acaagtcaaa 


cagcttaaat 


atgtcggcat 


ggtcttaaac 


960 


gaagcgctgc 


gettatggee 


aactgctcct 


gcgttttccc 


tatatgeaaa 


agaagatacg 


1020 


gtgcttggag 


gagaatatcc 


tttagaaaaa 


ggegacgaac 


taatggttct 


gattcctcag 


1080 


cttcaccgtg 


ataaaacaat 


ttggggagac 


gatgtggaag 


agttccgtcc 


agagcgtttt 


1140 


gaaaatccaa 


gtgegattec 


geagcatgeg 


tttaaaccgt 


ttggaaacgg 


tcagcgtgcg 


1200 


tgtatcggtc 


ageagttege 


tcttcatgaa 


geaaegctgg 


tacttggtat 


gatgetaaaa 


1260 


cactttgact 


ttgaagatca 


tacaaactac 


gagctggata 


ttaaagaaac 


tttaacgtta 


1320 


aaacctgaag 


gctttgtggt 


aaaagcaaaa 


tcgaaaaaaa 


ttccgcttgg 


eggtattect 


1380 


t cacct agca 


ctgaacagtc 


tgccaaaaaa 


geaegcaaaa 


aggcagaaaa 


cgctcataat 


1440 


acgccgctgc 


ttgtgctata 


cggttcaaat 


atgggaacag 


ctgaaggaac 


ggcgcgtgat 


1500 


ttagcagata 


ttgeaatgag 


caaaggattt 


gcaccgcagg 


tcgcaacgct 


tgattcacac 


1560 


geeggaaate 


ttccgcgcga 


aggagctgta 


ttaattgtaa 


eggegtctta 


taaeggtcat 


1620 


ccgcctgata 


aegcaaagea 


atttgtcgac 


tggttagacc 


aagcgtctgc 


tgatgaagta 


1680 
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^ « 4— 4— , 

dddygcgLLC 


gctactccgt 


at t t ggatgc 


ggegat aaaa 


act gggct ac 


t a cgt at caa 


17 4 0 


aaagt gcct g 


cttttatcga 


t gaaacget t 


gecget aaag 


gggcagaaaa 


catcgctgac 


loUU 


eg egg t gaag 


cagat gcaag 


cgacgacttt 


gaaggcacat 


at gaagaatg 


gcgtgaacat 


18 60 


at gtggagt g 


aegtagcage 


ctactttaac 


ctcgacat t g 


aaaacagt ga 


agataat aaa 


192 0 


LCLdCLCLLL 


cacttcaatt 


t gt cgacagc 


gecgeggat a 


tgccgcttgc 


gaaaatgeae 


198 0 


rrrr +~ /~« /t -I- 4— -t- +- 

yy LijLy l l ll 


uddcy dduy L 


egtagcaage 


aaagaacttc 


aacagecagg 


cagtgeaega 


on/in 
Z U 4 U 


dytdC-(jcgdc 


4— 4— 4— /-w- 3 — ^ — \ 4— 

aLLLLgdddL 


cgaacL icca 


aaagaagctt 


cttatcaaga 


aggagat cat 


ZlUU 


ttaggtgtta 


ttcctcgcaa 


ctatgaagga 


at agt aaacc 


gtgt a a cage 


aaggt t egge 


zlbU 


CLdydLyCdL 


CdCdgCdddL 


ccgtctggaa 


gcagaagaag 


aaaaattagc 


t catt t geca 


o o o n 
ZZZU 


"I - Z"" 1 /~T /^~» f- "Zl "11 

CtcgCtdadd 


cdgLdLCcgi 


agaagagctt 


ctgcaatacg 


tggagcttca 


agat cct gt t 


o o o n 
ZZoV 


cAoyoyoa.c.yo 


agcttcgcgc 


aatggctgct 


aaaaeggtet 


gcccgccgca 


t aaagt a gag 


Z J) 4 U 


ettgaagect 


tgcttgaaaa 


gcaagcctac 


aaagaacaag 


tgctggcaaa 


aegtt t aaca 


Z 4 UU 


atgettgaac 


tgcttgaaaa 


at acccggcg 


t gt gaaatga 


aatt cagega 


at tt at cgee 


2 4 60 


cttctgccaa 


gcatacgccc 


gegctattae 


4--—./-T— .4-4-4-,— ,4-4- 

tcgatttctt 


cat cacct eg 


tgt cgatgaa 


O c: O ^ 


aaacaagcaa 


geatcaeggt 


cagcgttgt c 


t caggagaag 


cgt ggagegg 


atatggagaa 


2580 


tataaaggaa 


tt gcgt cgaa 


ctatcttgcc 


gagct gcaag 


aaggagat ac 


gattacgtgc 


2 64 0 


tttatttcca 


caccgcagt c 


agaattt acg 


ct gecaaaag 


accctgaaac 


gccgcttatc 


27 00 


4— j~X r~X +- /— . /— j—i 

ci Ly y Luggdc 


egggaacagg 


cgtcgcgccg 


t t t agaggct 


ttgtgcaggc 


gcgcaaacag 


O ^7 r\ 

Z / o(J 


ctaaaagaac 


aaggacagtc 


act t ggagaa 


gcacatt t at 


act t egget g 


c cgt t cacct 


O O O 

z oz U 


catgaagact 


atctgtatca 


agaagagctt 


gaaaacgccc 


aaagegaagg 


cat cat t acg 


2 8 8 0 


cttcataccg 


ctttttctcg 


catgecaaat 


cagecgaaaa 


cataegttea 


geaegtaatg 


2940 


gaacaagacg 


gcaagaaatt 


gattgaactt 


cttgatcaag 


gagegcaett 


etatatttge 


3000 


ggagacggaa 


gecaaatgge 


acctgccgtt 


gaagcaaege 


ttatgaaaag 


etatgetgae 


3060 


gttcaccaag 


tgagtgaagc 


agacgctcgc 


ttatggctgc 


agcagctaga 


agaaaaaggc 


3120 


egataegcaa 


aagacgtgtg 


ggctgggtaa 








3150 



<210> 4 
<211> 1049 
<212> PRT 

<213> Bacillus megaterium 
<400> 4 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 
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Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
2 0 2 5 30 



lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 



Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 



Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 



Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 



Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 110 



Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 



Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 



Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 
145 150 155 160 



Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
165 170 175 



Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 ^ 190 



Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 



Asp lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg 
210 215 220 



Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 



Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 
245 250 " 255 
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Tyr Gin lie lie Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 



Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 



Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Ala Pro 
290 295 300 



Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 



Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

325 330 335 



Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 



Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 
355 360 365 



Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 



Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 400 



Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
405 410 415 



Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 
420 425 430 



Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
4 35 44 0 4 45 



Ala Lys Ser Lys Lys lie Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
4 50 4 55 4 60 



Glu Gin Ser Ala Lys Lys Ala Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 



Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
4 85 4 90 4 95 



Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 
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500 505 510 



Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 



Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 



Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 560 



Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
565 570 575 



Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 
580 585 590 



Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 



Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 



Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 



Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
645 650 655 



Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 



Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 



Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie 
690 695 700 



Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 



Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
725 730 735 



Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 
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Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 



Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 ' 780 



Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 



Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 



Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 
820 825 830 



Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 845 



Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 



Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 880 



Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
885 890 895 



Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 



Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 



Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 " 940 



Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 960 



Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 



Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 
9'80 985 990 



Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 



Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin 

1010 1015 1020 

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu 

1025 1030 1035 

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 

1040 1045 
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51 
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<213> 


Artificial sequence 


<220> 




<223> 


Primer 


<220> 




<221> 
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<222> 


(1) . . (51) 


<223> 





<400> 5 

gagattaaga attcataaac acatgggagt gcgtgccata tgaacgcaaa c 



<210> 


6 


<211> 


7 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Linker 


<220> 




<221> 


MISC FEATURE 
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<223> 





<400> 6 

Thr Asp Gly Gly Ser Ser Ser 
1 " 5 



<210> 7 

<211> 36 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> Desired coding sequence 
<220> 

<221> misc_f eature 

<222> (1) . . (36) 
<223> 

<400> 7 

gaactgagta gtgccactga cggaggatcc tcatcg 
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Artificial sequence 


<220> 
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Thr Asp Gly Gly Ser 
1 5 
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DNA 
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<223> 


Primer 


<220> 
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inisc feature 
(1) - • (36) 


<400> 


9 



cgatgaggat cctccgtcag tggcactact cagttc 

<210> 10 

<211> 36 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Desired coding sequence 



<220> 
<221> 
<222> 



mi sc_f eature 
(1) . . (36) 



<223> 



<400> 10 

tcatcgggat cctcatcgat gtctaaagta gtgtat 



<210> 


11 


<211> 


4 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Desired coding sequence 


<220> 




<221> 


MISC FEATURE 


<222> 


(1) - • (4) 


<223> 




<400> 


11 


Gly Ser Ser Ser 


1 




<210> 


12 


<211> 


52 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Desired coding sequence 


<220> 




<221> 


misc feature 


<222> 


(1) . • (52) 


<223> 





<400> 12 

cccgataggc aatggtaatc atcgggagtc tagagcatcg aagctttcat eg 



<210> 


13 


<211> 


52 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Primer 


<220> 




<221> 


misc feature 


<222> 


(1) . . (52) 


<223> 




<400> 


13 



cgatgaaagc ttcgatgetc tagactcccg atgattacca ttgectateg gg 
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<210> 14 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1) • • (24) 



<400> 14 

tcatcgacgc gtcgcgaact gctg 



24 



<210> 15 

<211> 69 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Desired coding sequence 



<220> 
<221> 
<222> 
<223> 



mi scf eature 
(1) • - (69) 



<400> 15 

cccgataggc aatggtaagt aggtgaatat ctaatcccca tctatgcgcg agtggagtct 60 
agagttcga 69 



<210> 16 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(1) . . (21) 



<400> 16 

tcatcgaagc ttggctgttt t 



21 



<210> 
<211> 
<212> 



17 
47 
DNA 



<213> 



Artificial sequence 



<220> 

<223> Desired coding sequence 
<220> 

<221> misc_f eature 
<222> (1) . . (47) 
<223> 



<400> 17 

acaatttcac acaggatcta gaccatatgt catcgaagct ttcatcg 



<210> 18 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<220> 

<221> miscf eature 

<222> (1) . . (47) 
<223> 



<400> 18 

cgatgaaagc ttcgatgaca tatggtctag atcctgtgtg aaattgt 



